Method and apparatus for processing mail to obtain image data of contents

ABSTRACT

A method and apparatus for processing mail is provided. Mail is placed into an input bin having a conveyor that conveys the mail towards a feeder. The feeder serially feeds the envelopes by engaging the lead envelope in the stack of mail and displacing the lead envelope transverse the stack of mail. The mail is then cut on a side edge and the top edge to cut open each envelope. A transport conveys the cut envelopes to an extractor. The extractor opens the edge-severed mail and presents the contents of the envelopes to an operator who manually extracts the contents. The operator drops the extracted contents onto a conveyor that conveys the contents to an imaging station. The contents are automatically separated and imaged to obtain image data for the contents. The contents are then sorted into a plurality of output bins.

PRIORITY CLAIM

This application is a divisional of U.S. patent application Ser. No. 11/651,674, filed on Jan. 10, 2007 now U.S. Pat. No. 7,992,853 which is a continuation-in-part application of U.S. patent application Ser. No. 10/931,529 now U.S. Pat. No. 7,916,892 and Ser. No. 10/931,530 now U.S. Pat. No. 7,706,914, both filed on Aug. 31, 2004, each of which is a divisional application of U.S. patent application Ser. No. 10/862,021, filed Jun. 4, 2004 set to issue as U.S. Pat. No. 7,537,203. This application also claims priority to U.S. Provisional Patent Application No. 60/476,532, filed Jun. 7, 2003 and U.S. Provisional Application No. 60/757,929 filed Jan. 11, 2006. Each of the foregoing applications is hereby incorporated by reference.

FIELD OF THE INVENTION

The present invention relates to the field of processing documents to obtain image data for the documents. More specifically, the present invention relates to a workstation operable to process documents by dropping the contents onto a conveyor that conveys the documents to a scanner. The present invention also relates to processing mail, such as envelopes having contents, wherein opened envelopes are presented to an operator so the operator can extract the contents from the envelopes.

BACKGROUND OF THE INVENTION

Automated and semi-automated machines have been employed for processing documents such as bulk mail. Due to the large quantity of mail received by many companies, there has long been a need for efficient sorting of incoming mail.

Further, in many instances it is desirable to obtain image data of the documents received in the mail. However, often the documents are folded and/or the documents received in a batch of mail are random sizes. In order to scan these documents, it is typically necessary to spend considerable time to manually prepare the documents so that the documents can be automatically fed into a scanner. This processing is inefficient, thereby increasing the overall cost of processing the mail and obtaining image data of the documents in the mail.

SUMMARY OF THE DESCRIPTION

Accordingly, a method and apparatus are provided for processing mail to efficiently handle the mail and obtain image data of the documents. Preferably, the present invention provides an apparatus that readily processes a variety of document types to obtain image data for the documents. For instance, preferably the apparatus accepts various sized documents and transports the documents to an imaging device, such as a scanner, to obtain image data. Preferably, the apparatus also allows the processing of folded or creased documents, and particularly groups of folded or stacked documents.

A method and apparatus are also provided that allow processing mixed mail with minimal or no preparation of the mail. A document or packet of documents can simply be dropped onto a conveyor, which conveys the document or documents to a feeder that serially feeds the documents downstream for further processing. One type of further processing uses a scanner. The feeder separates and serially feeds the documents to the scanner, which scans the documents to obtain image data for each document.

According to another aspect, a method and apparatus is provided for efficiently processing documents according to various information that can be identified for the documents on a job, batch, transaction, or document basis. Further, an interface is provided that allows the operator to intervene to supplement or amend the scanned information with additional information that may be tagged to the documents.

Yet another aspect relates to a feeder that is operable to accommodate mail with little or no pre-processing. The feeder is operable to receive packets of envelopes conveyed in a generally horizontal orientation, separate the documents from one another and serially feed the documents. The feeder includes a feed belt disposed generally parallel to a transport path along which the packet of documents are conveyed. The feeder also includes an angled feed arm that projects upwardly and outwardly over the transport path, forming an acute angle with the incoming transport path.

A method and apparatus are also provided for processing documents having detachable portions. For instance, a document processing apparatus comprising a document feeder, a scanner and an image processor are provided. The feeder is operable to feed a document having first and second portions connected by a perforation. The scanner is operable to scan the document to obtain a document image comprising image data for the first and second portions. The image processor is operable to analyze the image data to identify a characteristic of the document, and determine the location of the perforation based on the characteristic. In addition, the image processor may be configured to crop the document image based on the location of the perforation to remove the second portion of the document from the document image.

A method is also provided for processing documents having a first portion and a second detachable portion attached to the first portion by a perforation. The document is scanned to obtain a document image comprising image data for the first and second portions. The image data is analyzed to identify a characteristic of the document, and the location of the perforation is determined based on the characteristic. The document may then be cropped based on the location of the perforation, thereby removing the second portion of the document from the document image.

Further a method is provided for processing a batch of documents comprising a plurality of transactions of payment documents for a plurality of payees, wherein the transactions are payments from a payor to a payee comprising at least one check. The payment documents are scanned to obtain image data for the documents, and the payor account information is identified for the checks. The image data is analyzed to identify the payee for the transactions and to identify the payment amount for the checks in the transactions. Electronic payment records for the checks are prepared, wherein each payment record for one of the checks comprises the payor account information, the payment amount and the image data for the check. The electronic payment records are then submitted for clearing the checks on either a check by check basis or a transaction by transaction basis.

A method and apparatus are also provided to track documents to improve the efficiency of identifying a characteristic of a particular document. Specifically, an apparatus is provided, which includes a conveyor for conveying documents, which includes a first portion for receiving documents having a first characteristic and a second portion for receiving documents that do not have the first characteristic. The apparatus further includes a scanner for receiving documents from the conveyor. The scanner is operable to scan the documents to obtain image data for the documents. First and second sensors are used to track the documents. The first sensor is operable to detect the presence of documents received on the first portion of the conveyor; the second sensor is spaced apart from the first sensor and is operable to detect the presence of documents as the conveyor conveys the documents from the first portion of the conveyor to the scanner. A processor analyzes the progress of the documents in response to signals received from the sensors. Specifically, the processor analyzed the time that a signal is received from the first sensor regarding a first document and the time that a signal is received from the second sensor regarding the first document. In response to the analysis of the progress of the first document, the first document is selectively electronically tagged indicating the presence of a characteristic.

In addition, an apparatus is provided, which includes a conveyor operable to receive and convey documents and first and second sensors positioned along the conveyor. A scanner positioned adjacent the conveyor receives documents conveyed by the conveyor and scans the documents to obtain image data for the document. The apparatus further includes a processor operable to control the processing of the documents in response to signals received from the first and second sensors. The processor is operable to determine the time that signals are received from the first and second sensors, and selectively electronically tag documents in response to a signal from the first sensor indicating that the first sensor sensed the presence of the document and a signal from the second sensor indicating that the second sensor sensed a portion of the document within a select period of time.

A method for processing documents is also provided, which includes dropping a first document onto a first area, and conveying the first document from the first area to a scanner. At the scanner the first document is scanned to obtain image data for the first document. A second document is dropped onto a second area to identify the second document with a characteristic. The second document is then conveyed from the second area to the scanner, which scans the second document to obtain image data for the second document. The document is tracked along a document path between the second area and the scanner to verify that the document arriving at the scanner corresponds to the document dropped in the second area.

In addition, a method for processing documents in provided, which includes dropping a document in a first area and sensing the presence of the document in the first area with a first sensor. The time the first sensor detects an edge of the document is recorded and the document is conveyed away from the first area and past a second sensor. The time the second sensor detects an edge of the document is also recorded and the document is scanned to determine a characteristic of the document. In addition, the document is selectively electronically tagged as having a pre-determined characteristic in response to signals from the first sensor indicating the presence of the document in the first area and the time of recording the second sensor detecting the edge of the document. The document is then processed in response to whether the document is electronically tagged.

DESCRIPTION OF THE DRAWINGS

The foregoing summary as well as the following description will be better understood when read in conjunction with the figures in which:

FIG. 1 is a perspective view of a mail processing apparatus.

FIG. 2 is a fragmentary perspective view of a feeder section of the apparatus illustrated in FIG. 1.

FIG. 3 is a fragmentary plan view of the drop feeder and imaging station sections of the apparatus illustrated in FIG. 1.

FIG. 4 is a fragmentary cross-sectional view of the drop feeder and imaging station sections illustrated in FIG. 3.

FIG. 5 is an enlarged fragmentary view of the drop feeder and imaging station sections illustrated in FIG. 4.

FIG. 6 is a schematic side view of the drop feeder and imaging station sections illustrated in FIG. 3.

FIG. 7 is a schematic plan view of the drop feeder and imaging station sections illustrated in FIG. 3.

FIG. 8 is a fragmentary perspective of the drop feeder and imaging station sections illustrated in FIG. 3, illustrating the imaging section in an opened configuration.

FIG. 9 is an enlarged fragmentary side view of an extraction station of the apparatus illustrated in FIG. 1.

FIG. 10 is a schematic side view of an uncreasing element of the imaging station illustrated in FIG. 3.

FIG. 11 is a screen shot of an operator display of the apparatus illustrated in FIG. 1, illustrating a view sorted according to transaction.

FIG. 12 is a screen shot of an operator display of the apparatus illustrated in FIG. 1, illustrating a view sorted according to document sequence.

FIG. 13 is a screen shot of an operator display of the apparatus illustrated in FIG. 1, illustrating a view sorted according to bin.

FIG. 14 is a screen shot of an operator display of the apparatus illustrated in FIG. 1, illustrating a view sorted according to transaction.

FIG. 15 is a fragmentary perspective view of the drop feeder illustrated in FIG. 3.

FIG. 16 is an enlarged fragmentary perspective view of the feeder of FIG. 2.

FIG. 17 is an illustration of a check document having detachable portions.

FIG. 18A is an illustration of the check document in FIG. 17, illustrating scan areas for the document in alternative orientations.

FIG. 18B is an illustration of the check document similar to the document in FIG. 17, illustrating scan areas for the document in alternative orientations.

FIG. 18C is an illustration of a check document, overlying the scan areas illustrated in FIGS. 18A and 18B.

FIG. 19 is a plan view of a portion of the apparatus illustrated in FIG. 1, illustrating features of a document tracking system.

FIG. 20 is a flow chart illustrating the steps of a document tracking method.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

Referring now to the figures in general and to FIG. 1 in particular, a semi-automated mail processing workstation 10 is illustrated. The workstation 10 processes mail by severing one or more edges of each envelope in a stack of mail, and presenting the edge-severed envelopes one at the time to an operator who removes the documents from the envelope by hand. The operator can then drop the extracted documents individually or in stacks onto a conveyor that conveys the documents to an imaging station. The imaging station separates the documents, serially feeding the documents to an imager that obtains image data for the documents. The documents are then sorted into one or more output bins.

Brief Overview

A general overview of the flow of mail is as follows. Initially, a stack of envelopes containing documents, referred to as a job, is placed into an input bin. A feeder 30 removes the lead envelope 5 from the front of the stack and transfers the envelope to a feed tray.

The envelope 5 in the feed tray is edge-justified by a plurality of opposing rollers. From the feed tray, the envelope 5 drops into a side cutter, which severs the side edge of the envelope if desired. From the side cutter, the envelope drops into a shuttle. The shuttle moves vertically to adjust the height of the top edge of the envelope to account for variations in the height of the different envelopes in the job. The shuttle moves vertically until the height of the top edge of the envelope 5 is within an acceptable range for advancing the envelope into a top cutter. The envelope is then transported to the top cutter, which severs the top edge of the envelope 5.

From the top cutter the envelope is advanced to an extraction station 70. The extraction station 70 pulls apart the front and back faces of the envelope to present the contents of the envelope for removal. An operator then manually removes the contents from the envelope 5.

After the operator removes the documents from the envelope 5, the apparatus 10 automatically advances the envelope to a verifier 90. The verifier 90 verifies that all of the documents were removed from the envelope before the envelope is discarded. From the verifier 90 the envelope is conveyed into a waste container. Alternatively, the envelope 5 may be manually removed and imaged at the imaging station 130.

Returning to the flow of the documents, after the documents are extracted at the extraction station, the operator unfolds as needed and drops or places the extracted documents onto a drop conveyor 100 that transports the documents toward an imaging station 130. An imaging entry feeder 110 receives the documents from the drop conveyor 100 and controls the feeding of the documents into the imaging station 130. The image entry feeder 110 is configured to receive and feed documents of various sizes and condition. For instance, frequently documents are folded in an envelope. When the documents are extracted and opened up, the documents are creased or folded so that they do not lie flat. The feeder 110 is preferably configured to receive such creased or folded documents and serially feed the folded documents into the imaging station 130 with minimal manual preparation by the operator.

The imaging station 130 includes an imager 150 that obtains image data for each document as the document is conveyed past the device. For instance, preferably the imager 150 is a scanner that obtains gray scale or color image data representing an image of each document. The scanner scans each document at a plurality of points as the document is conveyed past the scanner. The information for each document is stored in a data file for each document so that the image data can be accessed at a later time.

From the imaging device, preferably an imaging transport 170 conveys the documents to a sorting station 200 that sorts the documents into a plurality of output bins 205. The documents can be sorted in a variety of ways. For instance, the documents can be sorted based on document information obtained from the image data received at the imaging station 130. Alternatively, the operator may indicate information regarding a document before it is scanned, so that the document is sorted according to the information indicated by the operator. Yet another alternative is that the documents may be stacked into one or more bins simply based on the order in which the documents are processed.

Since many of the documents may be creased, ordinarily the documents will not readily stack in a compact manner so that relatively fewer creased documents can be discharged into a bin before the bin is full. Accordingly, preferably, the documents are processed by an uncreaser 180, which is an element that reduces the creasing or folds in the documents. The uncreaser flattens or straightens the documents so that they lay more flatly in the output bins so that more documents can be discharged into a bin before the bin is full.

A controller controls the processing of the mail in response to signals received from various sensors at various locations of the workstation 10 and in response to parameters set for the job by the operator. For instance, in response to an indication from a sensor in the feed tray that there is no envelope in the feed tray, the controller sends a signal to the feeder 30 indicating that an envelope should be fed from the input bin to the feed tray. Similarly, in response to an indication from a sensor in the shuttle that there is no envelope in the shuttle, the controller sends a signal to the feed tray indicating that an envelope should be dropped from the feed tray into the shuttle.

The workstation is divided into numerous functionally separate sections, which include: a feeding station 30, a side cutting station, a top cutting station, the extraction station 70, the verification station 90, the imaging station 110, and the sorting station 200. In most cases, the controller controls the operation of the various sections independently from each other. This independence allows several operations to proceed simultaneously or asynchronously as required. As a result, a slow down in one section does not necessarily slow down all of the other sections.

In addition, preferably the operation of the apparatus from the drop conveyor through the sorting station are controlled separately from the operation of the other stations. Further, preferably, an operator interface is provided so that the operator can intervene to control the processing of the documents. Specifically, preferably a touch screen display 20 is provided that allows the operator to enter various information regarding the documents.

Configuration of the Work Station

As can be seen in FIG. 1, preferably the work station 10 is configured so that an operator working at the workstation has ready access to each working area. A seating area 15 at the front of the apparatus is centrally located, and the different stations are disposed around the seating area with the paper path flowing in a manner that the documents remain within easy access of the operator at the seating area.

Specifically, preferably, the feeding station 30 is disposed adjacent the right side, however, the feeding station can be located on the left side if desired. Preferably, the feeding station is within arm's reach of the operator from the seating area 15. Accordingly, preferably the distance from the seating area to the feeding station is no longer than one half the overall width of the work station, so that the operator can readily access the feeding station 30 from the seating area. From the feeding station 30, the mail pieces are fed along a document path that extends across the workstation along the width of the work station, intermediate the front and rear edges of the work station to the extraction station 70. Preferably, the extraction station is substantially aligned with the seating area 15 relative to the right and left edges of the work station so that the operator can readily grasp the mail at the extractor during operation. For instance, preferably the extraction station is generally centered between the right and left edges of the workstation, and preferably the center of the seating area is also generally centered between the right and left edges of the work station. In other words, preferably the seating area is disposed a distance from the left hand edge of the work station that is substantially similar to the distance that the extraction station is disposed from the right hand edge of the work station.

The drop conveyor 100 is preferably located adjacent the front edge of the work station, and is disposed between the extraction station 70 and the seating area 15 so that the operator reaches over a portion of the drop conveyor to grasp documents at the extraction station. More specifically, preferably a portion of the drop conveyor 100 is disposed adjacent the seating area 15 at the front edge of the work station. In this way, the operator can readily view, unfold and drop documents from the extraction station 70 onto the drop conveyor 100 when pulling the documents back toward himself/herself.

Preferably, the imaging station is disposed adjacent the front edge of the work station on the side of the work station opposite the side on which the feeding station 30 is disposed. Accordingly, the drop conveyor 100 conveys the dropped documents away from adjacent the seating area 15, along a path that is generally parallel to the front edge of the work station. Preferably, the imaging station 130 is disposed a distance from the seating area that is less than half the width of the work station so that the operator may readily access documents at the imaging station. In addition, preferably the document path from the imaging station 130 to the output bins 205 returns toward the seating area so that the output bins are spaced from the seating area a distance that is less than half the width of the work station. In this way, the output bins 205 are disposed conveniently near the operator at the seating area, so that the operator can readily remove processed documents from the output bins while the operator is at the seating area.

Details of the Stations

Feeding and Edge Cutting Stations

The feeding station 30 includes an input bin and a feeder. The input bin is configured to receive a stack of mail and convey it to the feeder. The feeder comprises a pivoting arm with a suction cup that grasps an envelope from the stack of mail and transports the piece to a side cutting station. In this way, the feeder 35 serially feeds mail from the stack of mail.

The side cutting station includes a plurality of drive rollers and opposing idler rollers. As the envelope passes between the rollers a rotary knife severs the side edge of the envelope. The severed edge drops down a scrap chute into a waste container.

From the side cutting station, the envelopes are top edge-justified so that the top edge remains at a consistent height. The envelopes may be justified by a pair of rollers to drive the envelopes upwardly against a stop at a predetermined height. However, such a roller justifier is typically limited to justifying envelopes that are similar in height. If there is too much variation among the envelopes in a batch of mail the justifier may not be able to properly justify the envelopes. For instance, if an envelope in a batch is unusually high, the top edge of the envelope may be positioned too high as it enters the justifier so that it causes a jam. If the envelope is unusually low, the top edge of the envelope may not engage the justifier rollers so that the envelope is not justified.

Accordingly, in order to accommodate a variety of envelopes, preferably the apparatus includes a shuttle that moves up and down to position the top edge of each envelope at approximately the proper height. The envelopes then enter the top-edge justifier to justify the top edge of the envelopes. The shuttle is a bin that receives each envelope and moves up or down as necessary to adjust the height of the top edge of each envelope as necessary depending upon the height of each envelope.

After the envelopes are top edge-justified, the envelopes are conveyed to a top cutting station that severs the top edge of the envelopes. In this way, the top and leading edge of each envelope is cut by the two cutting stations. Optionally, the side cutting station can be configured so that both sides of each envelope is severed. Yet another option is to eliminate or disable the side cutters so that only the top edge of the envelopes is opened.

Extraction Station

The extraction station 70 operates to pull apart the faces of the edge-severed envelopes and present the contents so that an operator can easily remove the documents. After the operator removes the contents, a sensor sends a signal to the controller that the contents have been extracted. The empty envelope is then transported to the verification station 90 and another envelope is fed to the extraction station 70.

Referring now to FIG. 9, the extraction station 70 includes a pair of opposing vacuum suction cups 74 mounted on two pivotal extractor arms 72. The suction cups 74 are connected to a vacuum pump. In FIG. 9, the extractor arms 72 are shown in two alternative positions. In the first position, the extractor arms are pivoted away from one another. In the second position the extractor arms are pivoted toward one another.

As shown in FIG. 1 the extraction station 70 is positioned in front of the seating area 15 intermediate the front and rear edges of the work station. Before an envelope enters the extraction station, the extractor arms 72 are pivoted away from one another. When the envelope enters the extractor, the arms 72 pivot toward one another and negative pressure is supplied to the suction cups so that the suction cups engage the faces of the envelope. The arms then pivot away from one another pulling apart the faces of the envelope, which have been severed along the top edge and preferably the side edge (see FIG. 9). The operator can then remove the contents of the envelope.

The document transport pinches the envelope between idler rollers and a belt. Therefore, when the extractor arms 72 pull apart the faces of the envelope, the envelope and its contents remain pinched between the idler rollers and the belt. To remove the contents, the operator must pull the contents with enough force to overcome the friction between the envelope and the contents caused by the pinching action of the extraction transport. In addition, this friction is maintained until the bottom edge of the contents is pulled past the pinch point. The extraction station 70 is preferably configured to operate in three different modes for determining whether the contents have been extracted from the envelope: removal mode, differential mode, and content activation mode.

The simplest mode is removal mode. An optical sensor 75 is located adjacent the extractor arms 72 in front of the extractor arms. When the operator removes the contents from the envelope the contents pass over the sensor 75 and the sensor detects the presence of the contents. A signal is then sent to the controller indicating that the contents were removed. The controller then controls the document transport to advance the envelope from the extraction station 70 to the verifier 90. In addition, an envelope is advanced to the extraction station. The envelope is advanced from the extraction station as long as some of the contents from the envelope are passed over the sensor 75, even if some of the contents remain in the envelope.

In the differential mode, an optical sensor 76 measures the thickness of the envelope immediately after the extractor arms pull apart the faces of the envelope so that the thickness of the envelope is measured before the operator extracts the contents. The optical sensor 76 continuously detects the thickness of the envelope and its contents, and compares the thickness with the initial thickness reading. If the difference in thickness is greater than a predetermined limit, a signal is sent to the controller indicating that the contents were removed from the envelope. The controller then advances the envelope to the verifier 90 and advances an envelope to the extractor. Preferably, the workstation includes a second optical sensor similar to the first sensor 76. The second sensor monitors the thickness of the envelope in the same way as the first sensor 76. When two sensors are employed, the measurements from the two sensors are averaged and compared against the predetermined limit to determine whether the contents were extracted.

If the operator removes all of the contents from the envelope, but the differential thickness is not greater than the predetermined limit, the envelope will not be advanced. In such instances the operator can advance the empty envelope by pressing an override button (not shown). Pressing the button operates to convey the empty envelope to the verifier 90 and convey an envelope to the extractor.

The content activation mode is like the differential mode in that the sensor 76 continuously detects the thickness of the envelope and its contents. However, in the content activation mode, the thickness detected by the sensor 76 is compared to a thickness standard based on the thickness of an envelope and a variation tolerance. If the sensor 76 detects a thickness that is less than the thickness standard, a signal is sent to the controller indicating that the contents were removed from the envelope. The envelope is then advanced to the verifier 90 and an envelope is conveyed to the extraction station 70. Preferably, two sensors 76 are employed, both of which monitor the thickness of the envelope as described above. When two sensors are employed, the measurements from the two sensors are averaged and the average is compared against the thickness standard.

If the operator removes the contents from the envelope, but the thickness detected by the sensor is not below the thickness standard the envelope does not advance. In such instances, the operator can advance the empty envelope by pressing the override button. In response to pressing the button, the empty envelope is conveyed to the verifier 90 and an envelope is conveyed to the extraction station 70.

The thickness standard used in the content activation mode can be determined in several ways. For example, the thickness standard can be based on the first envelope in a job. However, preferably, the thickness standard is calculated based on the average thickness of the envelopes processed in a job. In other words, the thickness standard is continuously updated based on an average of the thickness of the empty envelopes processed in a job. The thickness standard is then calculated based on the thickness of the empty envelope(s) and a predetermined variation tolerance.

Verification Station

The verification station 90 checks the thickness of each envelope to ensure that all of the contents have been removed from the envelope before the envelope is discarded into the waste container 25. The verifier 90 can use an optical sensor to check the thickness of the envelope, similar to the optical sensor or sensors used by the extraction station 70. However, the verifier preferably checks the thickness of the envelope by measuring the distance between the outer surfaces of the envelope faces. To measure this distance, the verifier 90 includes a rotary variable inductive transducer (RVIT).

To determine a zero thickness reference value, an empty envelope is fed to the verifier, and the sensor takes a measurement corresponding to the thickness of the empty envelope. The status of subsequent envelopes are determined based on the zero thickness reference value.

Alternatively, the reference value used by the verifier 90 to check the envelopes is calculated based on the average thickness of the previous envelopes and a variation tolerance, similar to the method described above for determining a thickness standard for the extraction step in the content activation mode. However, in the present instance, the calculation of the reference value differs from the calculation of the standard used in the extraction step. When calculating the reference value for the verifier 90, if the measured thickness of an empty envelope is greater than the current reference value, the thickness measurement for the envelope is not factored into the running average. For example, when calculating the thickness reference for the 100th envelope in a job, if the thickness of the 90th empty envelope was thicker than the reference value based on the previous sixteen envelopes, the thickness of the 90th envelope would not be included in the average used to calculate the reference value for the 100th envelope.

If the verifier 90 measures a thickness that is greater than the reference value, then a signal is sent to the controller indicating that the envelope in the verifier 90 is not empty. An indicator light (not shown) is lit indicating to the operator that the envelope at the verifier should be removed and checked to ensure that all of the contents were removed. A verifier sensor adjacent the RVIT sensor detects the presence of the envelope in the verifier 90. Until the operator removes the envelope from the verifier, the document transport will not advance any envelopes, regardless of whether the envelope in the extraction station 70 is empty.

If the verifier 90 detects a thickness that is less than the reference value, a signal is sent to the controller indicating that the envelope at the verifier is empty. The controller then activates the document transport to advance the envelope out of the extractor and into a trash chute that discards the envelope into the waste container beneath the verifier 90.

The operation of the feeding station 30, side and top cutting stations 40, 50 and extraction station 70 are similar to the operation of the apparatus described in U.S. Pat. No. 6,230,471, which is owned by Opex Corporation, who is also the assignee of the present patent application. U.S. Pat. No. 6,230,471 is hereby incorporated herein by reference. In addition, alternative feeding and cutting stations could be incorporated into the present apparatus. For instance, pending U.S. patent application Ser. No. 10/348,358 describes a mail processing apparatus having an alternative feeding station and edge cutting area that could be incorporated into the present apparatus in place of the feeding and edge cutting stations described above. Application Ser. No. 10/348,358 is hereby incorporated herein by reference.

The following description discusses the processing and imaging of documents that have been extracted from opened envelopes in the manner discussed above. However, in certain applications, the apparatus is operable to process documents without using the extraction features of the apparatus. For instance, the apparatus may be used to process a batch of documents that have been previously extracted, such as documents that are rejected by high speed automated processing devices. For such documents it is advantageous to use the feeding and scanning features as discussed below. Similarly, a batch of pre-slit mail may be processed, whereby the operator manually opens the slit envelopes and then processes the documents as discussed further below. Accordingly, unless otherwise noted below, the following discussion of the document imaging process is applicable to a variety of applications in which a batch of documents needs to be imaged, without regard to how the documents are obtained (i.e. the documents are provided in a stack as opposed to documents that must be extracted from envelopes). Features of the present invention are not limited to applications in which the envelope opening and extraction features of the apparatus are used.

Drop Conveyor

Referring to FIGS. 1-8 and 15, the drop conveyor 100 is configured to receive documents extracted from the envelopes. The conveyor 100 is disposed along the front edge of the work station 10, such that the conveyor is operable to convey documents adjacent to and parallel to the front edge of the work station. In addition, the conveyor preferably conveys the dropped documents toward the left hand side of the workstation. More specifically, preferably the drop conveyor is located so it has a drop zone centered on the seating area, and aligned with the extraction station.

Preferably the conveyor 100 is configured to readily receive documents that the operator manually removes from an envelope at the extractor. More specifically, the conveyor is configured to receive documents that are simply dropped onto the conveyor and to convey the dropped documents to the imaging station 130. In this way, the operator can readily extract and, if necessary, unfold documents and simply drop a document or packet of documents onto the conveyor with minimal preprocessing of the documents to prepare the documents for scanning.

Although the operator preferably drops the documents onto the drop zone of the conveyor, the drop zone is a substantial area that is much larger than the documents. Accordingly, the operator does not need to be precise with the location and orientation that the document are dropped onto on the conveyor. However, preferably the operator drops the documents so that the documents are front face up on the conveyor. Nonetheless, since preferably both sides of the documents are scanned at the imaging station, if the documents are front face down, the apparatus can analyze the documents to determine which face is the front face.

To this end, preferably the conveyor 100 is a roller bed conveyor. The bed of rollers provides a generally horizontal surface onto which documents can be dropped. The roller bed comprises a plurality of horizontally disposed cylindrical rollers 102 driven by a belt 103 engaging the bottom of the rollers, which in turn is driven by a motor controlled by the system controller. The rollers 102 may be parallel to each other and perpendicular to the direction of travel so that the documents move straight along the roller bed 100. However, preferably, the rollers are skewed so that the rollers drive the documents forwardly along the roller bed and laterally toward a justification rail 105. In this way, the skewed rollers 102 drive the documents against the rail 105 to edge-align or justify an edge of the documents against the rail.

Each of the rollers 102 comprise a plurality of grooves sized to receive O-rings. The O-rings have a higher coefficient of friction than the surface of the rollers, to provide an area of increased friction between the roller bed and the documents, thereby improving the justification of the documents. As mentioned previously, the document rests on the rollers. Therefore, as the rollers 102 rotate, the rollers move the documents forwardly.

If a single document or a packet of folded documents is placed onto the conveyor, the creases in the documents may tend to make portions of the documents project upwardly or downwardly, which could cause problems feeding the documents as they enter the imaging station. Accordingly, preferably the justification rail 105 is configured as shown in FIG. 4. Specifically, preferably the rail 105 is formed of a low friction material, such as smooth aluminum or low friction plastic, and is formed into a generally U-shaped channel. In this way, the rail forms an upper lip 106 and a lower lip 107. The upper lip 106 prevents documents from sliding up and over the rail as the documents are displaced forwardly and toward the rail. The lower lip prevents documents from being forced downwardly between the rail and the edge of the rollers as the documents are displaced forwardly and toward the rail.

Although, the drop conveyor 100 has been described as a roller bed conveyor, alternative types of conveyors can be utilized as the drop conveyor. For instance, the drop conveyor may comprise a horizontal conveyor belt. If a conveyor belt is used, preferably the belt is skewed toward the rail 105 so that the belt justifies the documents against the rail. Alternatively, rather than a single conveyor belt, the drop conveyor may comprises a plurality of smaller conveyor belts onto which the documents may be dropped.

Another alternative drop conveyor comprises a plurality of wheels that project upwardly from a base. In such a configuration, the conveyor includes a sufficient number of conveyor wheels that are spaced apart from one another so that the tops of the wheels form a bearing surface to support the documents. In this way, the documents are dropped onto the wheels, and the wheels are arranged so that when the wheels turn, the wheels drive the documents forwardly. Additionally, preferably the wheels are angled toward the rail 105 so that the wheels justify the documents against the rail as the wheels drive the documents forwardly.

Although the conveyor 100 is referred to as a horizontal conveyor, preferably the drop conveyor is angled downwardly so that gravity urges the documents toward the guide rail 105. This can be seen most clearly in FIG. 4. Preferably the conveyor 100 is angled at approximately five degrees, however, the angle may be higher, and in fact, the angle of the conveyor may be increased to a point that the conveyor is vertical rather than horizontal. In addition, preferably the imaging station and sorting station are angled downwardly similarly to the drop conveyor.

Image Entry Feeder

Referring to FIGS. 1-8, 15 and 16, from the drop conveyor 100, the documents enter an image entry feeder 110 at the end of the conveyor. As the documents are conveyed to the imager, the documents are generally horizontally disposed, riding on top of the drop conveyor 100 and are edge-aligned against the justification rail 105.

On the conveyor, the documents are only held down by gravity, and the upper lip of the justification rail depending on how the document is creased. Since the documents may be creased, gravity is generally not sufficient to flatten down the documents so that the documents can be fed to the imaging station. Therefore, the image entry feeder 110 preferably provides at least two function: (1) positively engaging the documents to pull or push the documents downwardly; and (2) controlling the feeding of the documents to the imaging station.

As shown in FIG. 6-7, preferably, the entry feeder comprises one or more parallel feedbelts disposed substantially horizontal and parallel to the base plate of the feeder. Although the entry feeder 110 may comprise a single wide belt, preferably the entry feeder comprises a plurality of parallel belts that are progressively longer so that the documents are progressively nipped over a wider area as the documents enter the feeder, as further discussed below.

Preferably, the feeder belt comprises a first belt 112 that operates as the lead feed belt. The lead belt may be longer than the other belts in the feeder so that the lead belt projects outwardly toward the drop conveyor further than the remaining feeder belts. In addition, preferably, the lead belt is positioned adjacent the rail 105 so that the lead belt is closest to the rail.

An idler roller 113 below the base plate is biased upwardly into engagement with the end of the lead feeder belt 112 to form an entry nip. This nip is the initial nip that serves to control the documents as they enter the feeder 110. Since the nip is close to the rail 105, the initial control point is adjacent the rail, which is the justification edge for the documents. The lead belt 112 pulls the documents forwardly, so that the documents engage the secondary feed belts 116. At this point, the documents are engaged by both the lead belt and the secondary belts, so that the area of engagement with the documents, (and therefore control over the document) is wider.

In addition, preferably the entry feeder further comprises an arm 114 that projects forwardly and upwardly from the leading end of the lead feed belt 112, parallel to the lead belt. The forward arm 114 comprises a rotatable belt that forms an acute angle relative to the drop conveyor 100. The arm operates to push down any documents that may have portions sticking upwardly, as discussed below.

Referring to FIGS. 2, 6-7 a folded document is illustrated on the drop conveyor. The document is exemplary of a document that was folded into three sections (i.e. tri-folded) and placed in an envelope. When the document was extracted from the envelope and unfolded, the document retained creases that cause the leading and trailing edges of the document to bend upwardly (or downwardly if the documents are flipped over relative to FIG. 2, 6-7). Similarly, the document could be a tri-folded document in which the crease lines are parallel to the direction of flow of the conveyor, rather than perpendicular as shown in FIGS. 2,6-7.

A document having a leading edge that is creased so that the leading edge is folded upwardly will tend to fold over as it is fed into the feed nip between the lead feed belt 112 and the idler roller 113. If the document folds over, it will not image properly. The operator could flip the document over so that the leading edge is not folded up, which could limit the potential problems associated with folding the document. However, it is desirable to configure the entry feeder so that it can accommodate the documents regardless of the orientation of the documents and the folds in the documents.

Accordingly, preferably, the entry feeder 110 includes the angled feeder 114, which rotates clockwise from the perspective of FIG. 6. The angled feeder arm, or knock-down arm is a feed belt that operates to drive the documents downwardly if the documents are creased and sticking upwardly. The position of the angled feeder arm tends to push the folded portion of the document down to flatten the document without folding over the document. More specifically, the knock-down feeder projects forwardly and upwardly from the horizontal lead feed belt 112 so that the forward edge of the knock-down belt is disposed higher than the lead belt, and angled downwardly to where the knockdown arm 114 is about the same distance from the drop conveyor 100 as the lead feed belt 112. In this way, the leading edge of the knock-down feeder arm 114 extends upwardly above the height of most all folded documents, so that the feeder flattens the documents without folding the documents over. At the same time, the drop conveyor and the knock-down feeder arm 114 urge the documents toward the lead feed belt.

Once a document is nipped between the lead arm 112 and the idler roller 113, the feeder drives the document forwardly. The shorter feed belts 114 then engage the document, so that the document is nipped (i.e. controlled) over a wider portion of the document. Additionally, the portion of the document nipped by the feed belts 112, 114 is held down flat. In other words, as the document enters the feeder, the document is progressively flattened.

The nip formed under the short feed belts 114 is formed between the feed belts and a retard 120. The retard is formed of a high friction material so that the frictional force between a document and the retard is greater than the frictional force between two documents. The retard can be formed in any of a number of configurations. However, in the present instance, preferably the retard comprises a pad having a plurality of spaced apart ridges 121 that are disposed between the belts that form the feeder, as shown in FIG. 5. In this way, the documents engage the retard between the belts.

The entry feeder 110 operates to feed the documents to the imaging station through a feed slot that is formed between the base plate 108 and an entry guide 132. Preferably the entry guide 132 is spaced from the base plate approximately ¼″. Since the documents are nipped by the feeder, the portion of the document at or adjacent the feeder will readily fit into the feed slot. However, the end of the document remote from the feeder may still stick up from creases or folds. For this reason, the leading edge of the guide is angled rearwardly (i.e. in the direction of the paper path, as shown in FIG. 7. In addition, preferably the leading edge of the entry guide is curved upwardly to provide a guide for forcing the leading edge of the document downward if the document has a portion sticking upwardly from folds or creases.

The foregoing discussion described the feeding of a single piece from the conveyor to the feed slot down stream from the feeder 110. However, the feeder is also operable to receive and separate packets of documents so that the documents are serially fed to the imaging station.

In order to accommodate packets of documents, preferably the lead feed belt 112 is spaced from base plate approximately 3/16″. When feeding the documents from the packet, the bottom-most document in the pack may tend to buckle in the gap between the feed belt 112 and the base plate. Specifically, the packet of documents is nipped between the retard 120 and the feeder 110, with the trailing end of the documents nipped between the lead belt 112 and the idler roller 113. When the second to last document is being fed through the feeder, the trailing edge of the second to last document is advanced out of the nip between the idler roller 113 and the lead belt 112 so that the last document in the stack is engaged by both the feed belt and the idler roller so that the nip drives the rearward edge of the last document forwardly. However, a portion of the second to last document is still nipped between the feeder and the retard, so that the last document is held back by the retard while the trailing end of the last document is driven forward, which can cause the last document to buckle between the baseplate and the bottom of the lead belt 112. Accordingly, preferably, an elongated thin flexible plastic finger 117 extends through the lead nip to support the document in the nip. Preferably the finger 117 is long enough to extend to the retard 120. By supporting the document, the finger prevents the document from buckling from the forward feed force when the retard is holding the piece back.

When a packet of two or more documents is received in the conveyor, the feeder is designed to control the documents so that they progress into the imaging station one at the time. If more than one document is fed at the time then one or more of the documents will not be imaged. The feeding of more than one document at the time is commonly referred to as a double feed.

A double feed is caused when two documents enter the imager together because the frictional force between the two documents tends to pull the second document along with the first document when the first document is fed to the imager. The retard 120 is provided to hold back the second document (as well as any other documents that may be in the packet), so that only the first document in the packet is fed to the imager.

Although it is typically desirable to utilize the retard so that packets of documents can be separated from one another, in certain instances it may be desirable to feed the documents without the retard. Accordingly, the device includes a pivotable retard control knob 119 that raises and lowers the retard. In a first position (which is shown in FIG. 5), the retard is raised so that the entry nip is formed between the feeder and the retard. By turning the control knob 119, the retard is lowered so that the top surface of the retard remains below the surface of the base of the imaging section so that the retard does not project up into contact with the documents. Instead, the entry nip is formed between the feeder and the base plate of the feeder.

Imaging Station

From the entry feeder, the documents serially enter a nip formed between a pair of crusher rollers 140. Although the entry feeder holds the documents down, it does not flatten the documents; it generally just holds an edge of the document flat against the base plate of the feeder. In contrast, the crusher attempts to flatten the creased documents.

The crusher rollers 140 are elongated cylindrical aluminum rollers 142 having a smooth surface. A plurality of elastomeric gripping rings 144 are formed around the circumference of the roller 142, and spaced apart from one another. Preferably, a first gripping ring is positioned at the end of the roller 144 closest to the entry feeder 110, and a second gripping ring is positioned on the roller a couple inches away. More specifically, preferably the second gripping ring is spaced inwardly less than the width of the feeder 110. In addition, preferably a third gripping ring is positioned adjacent the opposite end of the roller. The first and second gripping rings 144 provide nips that drive the paper from the entry feeder to the imager 150. The third gripping rings are positioned so that they are not in the paper path (i.e. the third gripping rings do not engage the documents. Instead, the third gripping rings provide spacing to maintain the rollers parallel with a constant gap.

Preferably, the first two gripping rings 144 on the rollers 142 are positioned so that both rollers engage a single fold for documents that are tri-folded with the fold lines disposed parallel to the paper path (i.e. the fold line would be perpendicular to the fold line on the document 8 in FIG. 2). In this way, the gripping rings engage the edge-justified third of the tri-folded document, while the rest of the document can slide across the width of the crusher roller since the remaining width of the crusher roller in the paper path is aluminum. In this way, the crusher roller flattens the documents without buckling the documents.

As the documents are processed through the imaging station 130 and through the imaging transport 170 and the sorting station 200, the documents should be properly spaced apart to prevent jammings to ensure a properly scanned image of each document, and to allow proper gating of the documents during subsequent processing. If the operator separates each document in a transaction and drops the documents serially onto the drop conveyor, the operator has provided the gap between the documents. However, if the operator drops a packet of documents onto the conveyor, the documents need to be separated in a way that provides proper gaps.

The gaps can be provided in one of at least two manners. First, the imaging entry feeder 110 can control the feeding of the documents to provide gaps. For instance, after the feeder 110 feeds the first document from the packet, the feeder may stop or slow down for a pre-determined time period before feeding the next document in the packet. This delay will create a gap between the documents. Alternatively, the gaps may be provided by running the feeder 110 at a slower speed then the transport path through the imaging station. This difference in speed will create a gap between the documents.

As can be seen in FIG. 2, a plurality of feeder exit sensors 135 are disposed in the feeder between the entry feeder 110 and the crusher roller 140. The feeder exit sensors detect the leading edge of each document to determine whether the document is skewed relative to the crusher roller (which is parallel to the imager). The feeder exit sensors 135 are preferably a plurality of optical sensors aligned with one another along a line that is parallel with the imager 150. Further, preferably, the first sensor is positioned adjacent the front edge of the work station, and preferably the second feeder exit sensor is spaced apart from the first feeder exit sensor a short distance so that the feeder exit sensors can detect the skew of relatively narrow documents. Further, a third sensor is aligned with the first two sensors and spaced further away (i.e. preferably at least 3-6 inches) away from the second sensor. The central controller receives signals from the feeder exit sensors. If the documents are slightly skewed, the documents may continue through for further processing. However, if the document is overly skewed, the central controller electronically tags the document for rejection so that the document is transported to a rejection bin without being imaged. Alternatively, if the document is extremely skewed, it may not pass through the imaging station and/or sorting station without jamming. Accordingly, the central controller stops the operation of the apparatus and provides an indicator (such as a warning light or a notice on the control screen) that the document needs to be removed from the imaging station.

After passing the feeder exit sensors 135 and the crusher roller 140, the document passes through a thickness detector 147 that measures the document at a plurality of points along the length of the document. The thickness detector may be any of a variety of sensors, such as an LVDT sensor. However, preferably the thickness sensor is a Hall effect sensor.

The Hall effect sensor 147 includes a sensor board disposed adjacent a magnet that is mounted on a linkage that biases the magnet toward the sensor. The magnetic field created by the magnet is measured by the sensor board as a function of the distance between the magnet and sensor. The magnet and sensor are linked to a pair of rollers between which documents are pinched when the document enter the thickness detector 147. When an envelope enters the thickness detector 147, the arms are forced apart, thereby separating the magnet and the sensor board accordingly, changing the magnetic field intensity. The thickness data is communicated with the system computer and is stored in a data file associated with the document.

From the thickness detector 147, the document enters the imager 150. Preferably the imager comprises a pair of scanners for scanning both sides of the document. Specifically, preferably the imager 150 includes a lower plate in which the lower scanner 150 is located, and an upper plate in which the upper scanner is located. The lower scanner 150 scans the bottom face of the document, and the upper scanner scans the upper face of the document. As shown in FIG. 8 preferably the upper plate is pivotable upwardly away from the lower plate to allow access into the imaging station 130 in the event of a jam in the imaging station.

Although the scanners may be black and white or gray scale, preferably, the scanners 150 are color scanners. More specifically, preferably the scanners 150 are contact image sensor (CIS) modules formed of arrays of photodiodes that operate as scanning elements, and LED light sources.

The scanner 150 scans the documents in three light colors, preferably red, green and blue. The scanner scans the documents and acquires data representing the light intensity for each of the three colors at discrete points of each document. For each color, the light intensity of each point, or pixel, is represented by a number ranging from zero to 255. The light intensity for each pixel is communicated to the computer and is stored in a data file. In addition, rather than being scanned in color, the documents can be scanned in gray scale.

As the document passes between the scanners, the scanners scan the faces of the document to obtain image data representing a color image of the document faces. The image is communicated with the system computer and the image data is stored in a data file associated with the document.

From the scanner, the document is conveyed to a MICR detector, which attempts to read any MICR markings on the document. Specifically, MICR markings are printed in magnetizable ink. The MICR detector includes a magnet that exposes the document to a magnetic field. The MICR detector also includes a MICR reader that scans the document for magnetic fluctuations indicative of MICR characters. If the apparatus detects the presence of a MICR line, the MICR detector attempts to read the MICR line. The data representing the MICR information is then communicated with the system computer, which stores the MICR data in a data file associated with the document.

Acquisition and Processing of Image Data

When the image entry sensor detects the presence of a document, the image entry sensor sends a signal to the system controller indicating the presence of a document. The system controller then sends a signal to the scanners regarding the document and data about how the document is to be processed. For instance, the system controller sends a signal to the scanning station indicating what type of document is being scanned and whether the default scanning parameters are to be modified for the document (e.g. the document is to be scanned on a darker setting).

The scanners scan the document to acquire image data. The system computer receives the image data from the scanners and then stores the data in memory. Once the image data is transferred to the system computer, the image data is processed. Although the scanner preferably scans the documents in color, preferably the image data is processed to provide a gray scale image that can also be binarized, as discussed further below.

First, preferably the gray scale data is binarized to create a black and white representation of the document image. By binarizing the data, the data for each pixel is converted from an eight bit gray scale representation to a one bit black or white representation, which significantly reduces the space that is required to store the image data. In addition, binarizing the image data operates to highlight the textual portions of the image, which is advantageous for further processing of the image data.

To binarize an image, the gray scale data for each pixel of the image is compared with a threshold. If the gray scale number for a pixel is above the threshold, the gray scale is converted to white. Conversely, if the gray scale number is below the threshold, the gray scale is converted to black.

To account for variations among the different documents being processed, an adaptive threshold may be used so that the binarization threshold is variable for each document. An adaptive threshold uses a different threshold for each particular document based upon a sampling of the image data for the particular document. By using a threshold that is specific to each particular document, the threshold for a document that has a generally dark image will typically be different than the threshold for a document having a generally light image. In this way, the resulting binarized image will more closely reflect the image of the document than if a set threshold is used for all of the documents.

In addition to binarizing the image data, the image data is filtered to reduce noise, which eliminates unnecessary background and stray marks from the document image. The noise filtering can be performed either in series or in parallel with the binarization. To filter the data, the imaging computer examines the image data to detect any black pixel that is surrounded by white pixels. If a black pixel is surrounded by white pixels, the imaging computer converts the pixel from black to white. Similarly, if a pair of adjoining black pixels are surrounded by white pixels, both pixels are converted from black to white.

After the image data is binarized and filtered, the imaging computer attempts to read information from the document. For instance, the imaging computer may attempt to read an OCR line if the document is an invoice or the MICR line if the document is a check. The OCR line data is helpful for later processing because the OCR line for documents, such as invoices, includes information about the customer's account and the amount of the invoice. During subsequent processing, it is helpful to know the customer account number. In addition, it is helpful to have other information about the document to process the document. For instance, it is helpful to know the invoice amount.

Based on data received from the system controller, the imaging computer knows what type of document is being scanned. Specifically, as discussed further below, the document type may be automatically determined for each document based on information that is acquired for the document during processing (e.g. document length and thickness). In addition, as discussed further below, the operator may manually identify the document type by selecting the appropriate document type when the document is dropped onto the conveyor. If the image represents a payment stub, the imaging computer processes the image data for the document in order to determine the document's OCR line, which typically appears at the bottom of payment stubs. The OCR line is a series of characters printed in a uniform predefined typeface of predefined size. Commonly, the typeface is a type referred to as OCR A, however, other types of typeface can be read, such as OCR B, E13B and others.

In addition, the imaging computer can function to process the image data to read the MICR line of checks so that the MICR line is read both optically and magnetically by the MICR character reader. As previously described, the MICR character reader magnetically reads the MICR line on checks. However, the MICR character reader may be unable to read one or more characters in a MICR line because of imperfections in the magnetic characteristics of the MICR line ink. These magnetic imperfections, however, may not affect the imaging computer's ability to read the MICR line from the optical image data, so that a character that cannot be read magnetically may be readable optically. Therefore, if the MICR character reader is unable to read a character in a MICR line, the data obtained optically is used to supplement the data obtained from the MICR character reader in an attempt to complete the MICR line data.

Alternatively, it may be desirable to use the imaging computer to verify the results from the MICR character reader. By verifying the results, the possibility of checks being processed with improper MICR data is reduced. For this purpose, the MICR line data obtained optically can be compared with the MICR line data from the MICR character reader. If there is any mismatch between the optically read MICR line and the results from the MICR character reader the imaging computer indicates that the MICR line was not determined. The system controller then tags the document as having an undetermined MICR line and the document along with the remaining documents in the same transaction are directed to a reject bin or sorted by the stacker accordingly.

After the OCR line or MICR line data for a document is extracted from the image data, the image data for the document is compressed using a compression algorithm, such as Group 4 fax compression commonly referred to as “CCITT Group 4.” The compressed image is combined with other data for the document. For instance, the document type may be combined with the data representing either the MICR line or the OCR line, along with data from the system controller to form a data record for the document. The data from the system controller includes information from the envelope from which the particular document was extracted, such as a change of address indication, the presence of a postnet barcode, and the presence of a mark indicating a customer response. The data from the system controller may also include an indication of whether the MICR line and OCR line were completely determined during imaging. Accordingly, the data record for a document includes the image data (the color image data, the gray scale image data and/or the binarized data), the MICR or OCR line, an indication of whether the OCR or MICR line is complete, a barcode, and miscellaneous information obtained during the processing of the document, such as the length and/or thickness profile of the document, as well as customer response data in the form of a change of address, or a check mark in a response mark.

Alternatively, it is frequently desirable to store the color or gray scale images of the documents in the data records rather than the binarized and filtered images. To store the color or gray scale images, a copy of the color or gray scale image data for each document is binarized and filtered as described above so that the image computer can determine the MICR or OCR line, and other information from the image data. The color image data is then compressed using data compression, such as JPEG, and the image data is combined with other information to create a data record as previously described for the binarized and filtered images. The binarized and filtered image can then be discarded from the imaging computer's RAM.

In addition to the previously described functions, the apparatus is operable to modify the image in any of several ways. For instance, the image can be de-skewed, and the black border that frequently occurs around the scanned image can be removed during processing of the image. The image can also be rotated to correct the orientation of the image. More specifically, a document may be fed into the imaging station in an improper orientation, so that the orientation of the image does not reflect the correct orientation of the image. For instance, a document may be printed in what is commonly referred to as a landscape orientation, in which each line of printing extends across the long side of the page (i.e. the length), rather than across the short side of the page (i.e. the width). If the short side of the page is fed into the imaging station, the image may be displayed in a normal upright portrait orientation, commonly referred to as a portrait orientation. In such an instance, the printing on the image would appear to be rotated (e.g. rather than reading from left to right, the printing would read from top to bottom). By rotating the image 90° the image would appear correctly.

Whether an image needs to be rotated can be determined in one of several ways. For instance, the operator may intervene and input information indicating that the image needs to be rotated, and how much. Alternatively, the imaging station can automatically determine whether the image should be rotated. In one application, the imaging station can determine the correct image orientation based on the document type. As discussed further below, the document type can be determined automatically based on various characteristics of the document (e.g. length, width, thickness profile, magnetic character position). Based on the document type, the imaging station can automatically rotate the image as necessary. For instance, checks are printed in a landscape orientation. By determining the length, width and location of MICR characters on the document, the device may automatically determine that a document is a check. Since the document is a check, the imaging station may rotate the image data as necessary so that the image data appears in a landscape orientation.

As described above, the document type is determined automatically based on characteristics of the document. However, as described further below, the document type can be determined manually or semi-automatically. Regardless of how the document type is determined, the image data may be automatically rotated to correspond to the proper orientation for the document type. Further still, rather than being automatic, the image rotation may be semi-automatic based on the document type. For instance, based on the document type, the system may determine that a document should be rotated, but the operator may be prompted to confirm whether the document should be rotated. Depending on the response (or possibly the lack of a response) the image data may or may not be rotated.

As described above, the image is rotated based on document type. In certain instances, it may be desirable to prevent the image from being rotated based on the document type. Specifically, the system may be configured so that images of documents of a certain type are rotated, as necessary, based on an assumed configuration for the document type. If the operator recognizes that the image of a document should be oriented in a specific way, the operator may override the system to scan the document in a certain configuration. For example, if a document of type “A” is assumed to be landscape orientation, the system will automatically rotate the image to be viewed in a landscape orientation. However, if the operator recognizes that a document of type “A” is actually in portrait orientation, the operator may press a button overriding the rotation. In other words, the operator can manually force the system to recognize the document in a certain orientation, and rotate the document or refrain from rotating the document as appropriate.

The image data can also be electronically stamped to identify audit trail information, such as the batch number, piece number, document-type and/or the date the document was processed. To electronically tag the document images, the image data is modified so that the audit trail information is “printed” on the image file for the document. In this way, the image data will show audit trail information as if the information was printed on the document before the document was imaged.

The audit trail information may be stamped onto the document image in one of several ways. For instance, the information may be stamped onto each document in a predefined area. Alternatively, a border may be added around the image data and the information may be electronically stamped in the border to ensure that the stamped information does not overlap and thereby alter the image data. In yet another alternative, the audit trail information is stamped onto the document so that each pixel that makes up the audit trail info is the exact opposite of the actual image data for the pixels where the information is being stamped. For instance, if the image is a binary image, and the information is to be printed in an area that has both black and white pixels, every pixel of the audit trail that is to be stamped onto an area that is black will be stamped as white, while the pixels that are white will be stamped as black. Although the above discussion of printing the audit trail information describes situations in which the image is binarized, the apparatus is also operable to stamp portions of the document image that are color or gray scale. Specifically, the electronically stamping is performed so each pixel comprising the electronic stamping is the gray scale or color number that best corresponds to the opposite light intensity of the pixels that make up the area that is to be stamped over.

Preferably, the documents are sorted and maintained in groups referred to as batches, which are identified by a unique batch number. The image data for a batch of documents is organized and maintained into batch data record files referred to as batch files. The batch files are organized so that the organization of the images in a particular batch file directly corresponds to the organization of the documents in the batch.

A complete batch file includes a batch header and the data records for each of the documents in the corresponding batch. The batch header includes information that is common to all of the documents in the batch, such as the batch number, the date the documents were processed, and the number of documents in the batch. Once all of the records have been appended to a batch file, the batch file can be exported to an image file server in the form of a personal computer, PC, via an ethernet connection.

Difficulties may arise when imaging the documents. For example, the MICR module 220 and the imaging computer may not be able to read the MICR line for a check or the OCR line for a payment stub. If such a problem arises, the image, along with the data that was ascertained from the image may be exported to the respective batch file for the document and the document may be sorted with the other documents in the batch. In this way, the document and the image data for the document are available for subsequent processing, at which time the missing information can be manually read and entered into the data record for the document. Alternatively, the display may prompt the operator to examine the scanned image displayed on the screen and enter the missing information. Alternatively, if some information, such as the MICR line or OCR line is not fully read for a document, the document along with the other documents in the same transaction may be directed to a reject bin. If documents are directed to a reject bin, then the image computer discards the image data for the documents in the transaction.

Although the description has focused on the processing of checks and invoices, the apparatus is not limited to imaging checks and documents. More specifically, the apparatus is operable to scan and process a variety of documents that may include a variety of information that may be scanned and analyzed. For instance, the imaging station is configured to accommodate documents at least as large as 8½×14. In addition, the image data exported to the data file may include more data than just the image file for the entire document. For instance, portions of the documents can also be included in the data file. For instance, for a check, an enlarged view of the courtesy box on a check can be included in the data file. Further, the portion of an image in which a bar code is located can be separated out as a file and included in the data file. The portions of the scanned images that are attached to the data file is not limited to a single portion. Preferably, up to ten different image data sets can be included in the image data file. In this way, a variety of image portions can be stored in the data file together with the image data for the entire image, if desired. Additionally, a variety of information can be stored in the data file with the image data, such as the length and/or height of the document, the thickness profile, the MICR line etc.

In addition, in certain instances it may be desirable to feed a batch ticket before the batch of documents are fed. The batch ticket includes a variety of information specific to the batch of document. For instance, the batch ticket may identify the batch number and the date that the images were processed.

Imaging Transport

As shown in FIG. 8, the imaging transport 170 extends between the imaging station 130 and the sorting station 200. Preferably the imaging transport is formed of two halves, and the upper half 172 is pivotable away from the lower half 174 to provide access to the transport path to remove any paper jam in the transport, or perform service on the interior element.

As shown in FIG. 8, the document path between the imaging station 130 and the sorting station 200 is preferably not a straight horizontal path. Instead, preferably, the imaging transport 170 turns upwardly and curves backwardly toward the seating area 15. Specifically, the imaging transport 170 conveys the document away from the scanners 150 in a generally horizontal direction parallel to the base plate of the lower scanner plate. The transport 170 then curves upwardly, thereby turning the document upwardly along a generally vertical path. The transport 170 then curves back generally horizontally toward the seating area. In other words, the transport essentially provides a U-turn, turning from the imaging station back toward the operator. The reverse leg of the U-turn also extends somewhat upwardly at an angle relative to the scanner base plate. At the end of the reverse leg of the U-turn, the transport 170 curves back upwardly, thereby turning the document upwardly along a generally vertical path to the sorting station 200.

Between the imaging station 130 and the sorting station 200, preferably an optional uncreasing station and a printer are disposed along the transport path.

Referring to FIG. 10, a schematic view of the uncreaser 180 is illustrated. As discussed previously, it is desirable to remove the creases from folded documents so that the documents can be more efficiently stacked in the output bins 205. The uncreasing station 180 is a guide having a sharp edge that the documents pass over as the documents turn around the U-turn.

More specifically, the transport path includes a pair of guides that direct the documents upwardly. The guide on the outside is a generally smoothly curved guide 182. The guide 184 on the inside of the turn includes an edge. For example, comparing the outside guide to the inside guide, the outside guide is a smoothly curved metal track, whereas the inside track includes two generally flat or straight legs formed at an angle to one another. The intersection of the two straight tracks forms an edge that operates as a sharp edge.

Preferably the documents are pulled tightly against the sharp edge as the documents are conveyed across the sharp edge. To do so, preferably the documents are tensioned by increasing the transport speed on the downstream side of the sharp edge. Specifically, a nip is formed downstream from the sharp edge, and the transport speed from the nip forward is higher than the transport speed upstream from the uncreaser. In this way, when the document enters the nip, the higher speed of the transport pulls on the document, which tensions the document as it is pulled over the sharp edge. Pulling the documents over the sharp edge removes or reduces the creases in the documents.

Although the uncreasing device has been described as a guide having a sharp edge, various other types of uncreasing elements can be employed. For instance, the sharp edge can be mounted on a solenoid adjacent the document path through the transport 170. The solenoid can be controlled so that it displaces the sharp edge into the paper path so that the paper is pulled over the sharp edge when the solenoid is actuated. The solenoid can then be de-energized to retract the sharp edge out of the paper path on selected documents or on selected portions of documents (i.e. if only the second half of a document needed to be uncreased, the solenoid could be timed to be actuated to displace the sharp edge into the path when the second half of the document passed by). Yet another alternative uncreaser is a heater. The heater includes a pair of rollers forming a nip that the documents pass through. One of the rollers is heated to an elevated temperature sufficient to reduce creases in the documents.

Additionally, the apparatus may include a second uncreaser to uncrease the opposite side of the document. Specifically, the second uncreaser may be configured similarly to the first uncreaser except that the second uncreaser may be formed by the outside guide, rather than the inside guide as described above.

Although the uncreaser has been described as being located between the MICR reader and the printer, it may be desirable to locate the uncreaser in a different location. For example, the uncreaser may be located prior to either the scanners 150 or before the MICR reader. Further still, the uncreaser may be located downstream from the printer if desired.

In addition, preferably a printer is disposed along the transport 170 so that the printer can print markings on the documents as they are conveyed to the sorting station 200.

After the documents are imaged, the transport conveys the documents to a printer module 190 which is controlled by the imaging computer. The printer module 190 includes at least one ink jet printer. The printers 190 are disposed behind covers in the imaging transport. More specifically, a first printer is preferably disposed behind a plate in the upper portion 172 and preferably the second printer is disposed behind a plate in the lower portion 174. In response to signals from the computer, the printer module 190 prints audit trail data onto each document. The audit trail information printed on a document includes data particular to the document, such as the document type for each document, the batch number for the document, the document number, the transaction number for the transaction of which the document is a member, and the date on which the document was processed. The audit trail information can be used to subsequently locate a particular document within a stack of documents. Generally, it is desirable to only print information on the back face of each document. However, the printer module may also include a second ink jet printer to print additional information on the front face of the documents. If a second printer is used, the imaging computer controls the printers to selectively print on either or both sides of a document.

Sorting Station

The sorting station 200 is disposed at the end of the imaging transport 170, and the sorting station includes a plurality of gates 204 operable to sort the documents into one of a plurality of bins 205. Preferably, the documents are processed so that the documents are discharged into the bins 205 face down, so that the stack of documents in the output bins is in the proper order so that the stack does not need to be re-ordered after the documents are processed.

As shown in FIG. 8, the sorting station includes a plurality of gates that are operable to direct the documents to the appropriate bin 205. The sorting can be based on a number of criteria. For instance, the documents can be sorted according to information determined from the image data. One such example relates to forms that have a change of address check box (i.e. the consumer checks the box if the consumer's address changes). The image data can be analyzed to determine whether a change of address is indicated. If so, the sorter 200 can sort such documents to a particular bin 205 to keep such documents together. Similarly, the documents can be sorted based on other criteria, such as the thickness profile of the document, which can indicate what type of document the document is, or the documents can be sorted based on the length of the documents, which can be measured by one of the sensors at the entrance to the imaging station. In addition, the documents can be sorted based on a combination of the information that is determined for a document (i.e. thickness profile, length, MICR reading, OCR reading, barcode reading or an identifying mark). Further, documents that are electronically tagged for outsorting or rejection can be sorted to a particular bin.

In addition to identifying the pieces automatically, the operator may manually identify the document type before the document is scanned. The piece is then sorted based on the indicated document type. For instance, there are five bins in the sorting station, and there may be five or more different document types defined for a particular batch of documents. Each type of document is sorted to a different one of the bins 205, unless there are more document types than bins, in which instance, more than one document type will be sorted to some or all of the bins. Each document type is represented by a different button on the touch screen 12. When the operator places a document onto the conveyor 100, the operator presses the button that identifies the document type, and the document is sorted accordingly. Alternatively, when the operator pushes the button to identify the document type, the apparatus may assume that all subsequent documents are the same type of document until the operator presses a different button. In this way, the operator need not manually identify each document if a batch of documents contains groups of the same type of document.

Further still, the manual indication described above can be combined with the automatic determination described above. Specifically, the apparatus may automatically identify the document type unless the operator manually overrides the system by identifying the document type for a document when it is placed onto the conveyor.

In the discussion above, the documents were sorted according to various sort criteria based on information about each document. However, in some applications it may be desirable to simply sort the documents in the order in which the documents are processed. In such an application the documents are sorted to the first bin 205 until the bin is full. The documents are then directed to the second bin until it is full, and/or until either all of the bins are full or all of the documents are processed. This process of switching from one bin to the next when the bin is full is commonly referred to as waterfalling, and the bins are operable to waterfall in various sorting applications.

As discussed previously, the documents may have creases that frequently affect the processing of the documents. At the sorting station 200, the creases can affect the stacking of the documents in the bins. Specifically, if the documents are creased along lines perpendicular to the document path, the documents will tends to fold underneath themselves as they are discharged into the output bins 205. Accordingly, preferably the sorting station includes corrugation rollers to corrugate the documents so that the documents are rigid enough to discharge properly so that the documents stack properly in the bins.

Preferably, the corrugation rollers are configured as follows. The rollers are positioned on two parallel shafts, with one of the shafts being biased toward the other shaft. Opposing discharge rollers are located on the outer ends of the shafts. The discharge rollers provide discharge nips that the documents pass through. Intermediate the discharge roller are three corrugation rollers that are larger than the discharge rollers, preferably by at least approximately 25%. Two of the corrugation rollers are positioned on the first shaft, with the third roller being positioned on the second shaft at an axial location between the two corrugation rollers on the first shaft. In this way, the three corrugation roller project into the paper path creating a deep corrugation that provides sufficient rigidity to properly discharge and efficiently stack the documents.

Bypass

As discussed above, the documents leaving the scanners 152 are conveyed along the imaging transport, which extends upwardly to the output bins. Alternatively the sorting bins can be bypassed by directing the documents through a bypass that has an opening across from the imaging station and under the imaging transport 170. The documents can be conveyed horizontally from the imaging station 130 and discharged to an output bin or area 166 on the backside on the imaging station (see FIG. 3). The bypass can be used in several applications to improve the flexibility of the apparatus.

For instance, if a batch of documents simply needs to be imaged without sorting the documents after processing, then the sorting station is unnecessary. The documents can be dropped onto the conveyor 100 and then fed to the imaging station 130. From the imaging station the documents are conveyed straight back and discharged though the bypass 165 into the output area 166. In this way the scanned documents are simply stacked in a single stack in the output area.

The bypass also allows for thick documents to be scanned. Specifically, documents that are thicker than twice the thickness of paper may tend to jam in the imager transport after the documents are imaged. However, the documents will not typically jam if they are discharged out the bypass. Accordingly, thick documents can either be processed together so that all of the documents in the batch are scanned and then directed out the bypass 165, or optionally, the thick documents can be a defined document type that can be analyzed automatically based on the output from the thickness detector 147. If the thickness detector 147 detects a document that is greater than a predetermined limit, the document is directed to the bypass 165. Alternatively, the operator may be able to identify the document as a thick document when the document is dropped onto the conveyor 100. The document is then directed to the bypass 165 after being imaged. (There is also a maximum thickness for thick documents. If the thickness is greater than the maximum thickness, the document is too thick to process, even through the bypass.)

Yet another use of the bypass 165 relates to disposing of envelopes. As discussed further below, opened envelopes may serve as an indicator of the beginning or end of a transaction. Typically, it is not necessary to retain the document and it would be a waste of space in the sort bins 205 to fill them with the envelopes.

Accordingly, when using opened envelopes as transactional markers, preferably the envelopes are discarded through the bypass 165, while the documents are scanned and then sorted into the sorting bins 205. The envelope can be identified as an envelope in one of several ways. First, the operator can manually identify the envelope as an envelope. Second, and more preferably, the apparatus identifies the envelope as an envelope based on the thickness profile detected by the thickness sensor 147. If the thickness profile for a document correlates to an envelope, then the apparatus assumes that the item is an envelope and the envelope is directed to the bypass and discharged to the discharge area. For this reason, when processing envelopes with the documents, preferably a waste container is provided at the discharge area so that the envelopes can be conveyed directly into the trash.

In the previous discussion, it is assumed that it is not desired to keep the envelope after processing. In many instances, since the envelope is merely a transaction marker, there is no need to retain the image of the envelopes, so that the envelope images are also discarded. However, in certain instances it may be desirable to retain the envelope image even though the actual envelope is discarded.

In yet another application, the bypass 165 is used to accommodate envelopes that have contents inside. The function of this application is similar to the processing of thick documents described above. In such an application the envelopes are scanned to obtain an image of the front and rear faces of the envelope. The envelope is then discharged out the bypass 165, where the envelopes are stacked.

In still another application, the bypass is used to accommodate stacks of documents that are processed together so that only the top document is scanned. The stack of envelopes is then directed through the bypass. One application of this feature is useful in processing certain types of returned mail in which the recipient's address has changed and the post office will no longer forward the mail. Such mail may be returned with certain information about the recipient obscured (such as the recipient's account number) and the new address printed by the post office. Presumably, one of the documents in the envelope, such as the first document, will have the recipient's account number. Since the documents in the envelope are returned documents, there is probably no reason to image all of the documents. However, it would be desirable to image the envelope (showing the new address) and one of the documents (showing the customer account information). Accordingly, the operator may input information specifying that the documents are to be processed without separating the documents. The retard controller 119 is turned so that the retard is disengaged. The packet of documents is then placed on the conveyor 100, as well as the envelope. The packet of documents is then processed through the scanner together as if it is a single document, so that the top face of the top document is scanned. The packet of documents and the envelope are then directed out the bypass 165. Alternatively, if both addresses are viewable on the face of the envelope, the envelope can simply be imaged without removing any of the documents. The envelope and the enclosed contents are then directed out the bypass. The operator may select whether the envelope should be processed alone or with one of the enclosed documents. Before processing the envelope, the operator identifies how the envelope is to be processed, inputs the information into the system, such as by a button, keyboard, mouse or otherwise, and then drops the envelope (and document if appropriate) onto the conveyor.

Maintaining Transactional Integrity

All of the documents in an envelope are referred to as a transaction. In addition, in some applications the envelope may be considered part of the transaction. It is often important to ensure that documents from a first transaction do not get mixed with documents from a second transaction. This is referred to as maintaining transactional integrity.

Because each envelope defines the boundaries for each transaction, and the documents are initially contained within envelopes, the boundaries for each transaction are known. However, once the documents are removed from an envelope and are placed onto the drop conveyor it is possible for the documents from different transaction to become mixed. Accordingly, preferably steps are taken to maintain the transactional integrity after the documents are removed from the envelopes. In addition, in applications in which the documents are already extracted from the envelopes, it may require intervention from the operator to indicate the end of a transaction.

One method for maintaining the transactional boundaries is to manually indicate the beginning point of a transaction. For instance, the apparatus may include a push button or a foot pedal that the operator depresses to indicate the beginning or end of a transaction. Since the operator extracts the documents from the envelope, the operator knows which documents belong to a transaction. After the operator extracts the documents they are placed onto the drop conveyor one at the time. After the last document is placed on the conveyor, the operator indicates the end of the transaction by pressing a button (or otherwise). The next document will be processed as the first document in the next transaction.

A second method for maintaining transactional boundaries is to automatically define the transaction based on the document type. For instance, if the transactions include two types of documents, the operator places the documents onto the drop conveyor so that the second type of document is always the trailing document. Therefore, when the apparatus determines that a document is the second type of document, the apparatus assumes that the document is the last document in the transaction, and the next document processed is considered to be in a new transaction. When using this method, the document type can be determined based on different characteristics that have been determined for the documents. For instance, the apparatus can determine that a document is a check if it has a certain length and a MICR line. Therefore, if checks are the identifying document, the end of each transaction is automatically defined when the apparatus detects a document having the certain length and a MICR line.

One way to implement the second method is to use the envelope as the trailing document. An envelope is readily distinguishable from documents based on its thickness profile since envelopes generally have numerous folds that cause thickness differences along the length of the envelope. Although only the contents were imaged in the discussion above, the envelope can also be fed into the imaging station. Once the apparatus identifies a document as an envelope based on the thickness profile, the transaction is closed. If it is desired to keep the envelope, the envelope can be sorted, and the envelope image can be included in the images of the transactional boundaries. However, typically the envelope simply serves as a divider, so that the envelope is outsorted through the bypass (as discussed above), the envelope image is deleted and the document that preceded the envelope is considered to be the last document in the transaction. Alternatively, the envelope can be the leading document, so that the envelope marks the beginning of the transaction rather than the end.

In certain instances this second instance may not properly identify the transactional boundaries. For instance, using the above check example, if a transaction includes three documents and two of the documents are checks, the transaction will be split into two transactions. Accordingly, rather than automatically defining the transactional boundaries by document type, the boundaries my be defined by the transition from one type of document to another. In other words, the end of a transaction is defined by the point at which the documents transition from one type of document to another. If documents of the second type are supposed to be the trailing documents, then the apparatus assumes that when it identifies a transition from a document that is of the second type to a document that is of the first type, there is a transactional boundary. The document of the first type is assumed to be the first document in the next transaction.

Returning again to the check example, if checks are supposed to be the trailing document type, then all of the documents in a series of documents are considered to be in a single transaction until the apparatus identifies a transition from a check to a different type of document. The last check is included in the first transaction and the subsequent document is considered to be the first document in a second transaction.

A fourth method of maintaining transactional boundaries is a more automatic method that utilizes the extraction sensors. As discussed further above, the apparatus includes several sensors 75,76 that monitor the envelope at the extraction station to determine whether the contents have been extracted. Once the sensors indicate that the contents have been extracted, the envelope is automatically conveyed forwardly. This indication that the envelope is empty can be utilized to demark the transactional boundaries.

For instance, once the sensors indicate that an envelope is empty, the apparatus assumes that all of the documents that are received in the imaging sensor within a subsequent time frame are considered to be in the same transaction. Any documents after the time period are considered to be in the next transaction. For instance, the time period may be 2 seconds, so that after receiving an indication from the sensors that all of the documents are extracted from an envelope, all of the documents that are received in the imaging section in the next 2 seconds are considered to be the transaction that was extracted from the envelope.

In the second, third and fourth methods of defining the transaction, it was assumed that the transaction were determined automatically based on different characteristics. In addition, during any of these methods, the operator may manually override the determination by pressing a button or otherwise, as described in the first method.

Operator Interface

As discussed previously, the apparatus includes a touchscreen display to provide an input/output device for the operator to interface with the operation of the machine. The interface between the operator and the apparatus may be more clearly explained with reference to FIGS. 11-14, which are pictures of screen displays.

Referring first to FIG. 11, the display includes three general areas. The bottom portion of the screen provides a toolbar of control buttons. The operator can control different aspects of the operation of the apparatus with these buttons, as described further below. The second part of the display is the left side of the screen above the toolbar. This left hand portion displays thumbnail views of the recently scanned documents. The third portion of the display is the right side of the screen above the toolbar. This right hand portion is a large view of the last document to be scanned, unless a different document is selected by the operator, as discussed further below. In addition, under the third area a window is provided in which information is displayed, such as information regarding the batch of documents.

Referring to FIGS. 11-13, the thumbnail view section of the display can be displayed in one of three views. In the first view (FIG. 11), the thumbnails are organized by transaction. In other words, each row represents a transaction, so that a thumbnail of each document in a transaction is viewable in the same row. For instance, the transaction in the first row includes four documents and thumbnail views of each document are displayed in the first row. In contrast, the third transaction includes only two documents, as can be seen by the two thumbnail views in the display.

In the second view (FIG. 12), the thumbnail views are displayed according to the bins 205. The first row displays the thumbnail view for each document in the top bin, and the thumbnail views are in the same sequence that the documents are in the top bin. Similarly the second row displays the thumbnail views for the documents in the bin under the top bin, and so on through the bottom bin.

In the third view, the thumbnail views are simply displayed in the sequence in which the documents were processed. The first thumbnail in the first row represents the first document scanned in a batch, and the thumbnails continue in the first row until the end of the first row. The next document is then displayed in the leftmost position in the second row. The display of the thumbnail views continues in this pattern until all of the documents are displayed.

By touching the thumbnail view of any document, an enlarged view of the document is displayed in the right-hand viewer portion of the display. In addition, if a document is selected in one view (e.g. transaction view) and the view is changed (e.g. to bin view), the selected document is also selected in the changed view.

More specifically, if the display is in transaction view, a document in a transaction can be selected, as shown in FIG. 11 (the selected document is identified by a dark border surrounding the selected document). If the operator desires to retrieve the selected document, it would be desirable for the operator to know which bin the document is located in, and how far the document is from the top of the stack. Accordingly, after selecting the document in transaction view, the operator can switch to bin view by touching the “Switch” button on the button toolbar. The thumbnails will then be displayed according to bin, and the document that was selected in the transaction view will remain selected in the bin view so that the operator can see which bin the document is in and by counting the documents in the row that the selected document is in, the operator can determine how far the document is from the top or bottom of the bin.

Similarly, if a document is selected in bin or sequence mode, and the view is switched to a different mode, the selected document will remain highlighted after the view is switched.

The other buttons in the toolbar relate to other features of the device. For instance, as discussed above, during operation, the operator may identify the document type when the document is dropped onto the conveyor 100. Preferably, the five buttons in the center of the toolbar represent the five different types of documents being processed for a batch of documents. Accordingly, to identify the document type the operator can simply touch the appropriate button when dropping a document on the conveyor.

Further, as discussed above, the operator may manually identify the transactional boundaries. Accordingly, after the operator places the last document in a transaction onto the drop conveyor, the operator touches the “new transaction” button, thereby indicating that the next document is in a new transaction.

In addition, it may be desirable to include controls to allow the operator to specify special scanning criteria for a document. For instance, if the operator notices that a document is particularly light, the operator may want to indicate that the document should be scanned on a darker setting than usual. Similarly, the operator may want to change the image scanning from color to gray scale or vice versa, or the operator may want to change the scan resolution. To provide such control, a special handling button may be provided on the display. When the operator desires to specify some type of special handling, the operator touches the special handling button when the document is dropped onto the conveyor. A display of the different special processing options may then be displayed, thereby allowing the operator to control the scanning characteristics on a piece by piece basis if necessary.

Reject Processing

During processing, a document may be rejected for one of several reasons, such as too much skew or insufficient gap between documents for example. There are two primary modes of processing the rejected documents and the documents that were upstream from the document when it was rejected.

In the first reject processing mode, the exact sequence of all of the documents within a batch is not critical. In such a mode, the rejected piece is sorted to a reject bin or directed out the bypass 165, and the subsequent documents in the transaction are processed in accordance with the ordinary procedures discussed above. At the end of the batch, all of the rejected documents are retrieved and processed again.

In the second reject processing mode, the order of the documents either within a batch or within the transactions is important. Therefore the rejects and subsequent documents should be processed in a manner that allows the proper order of the pieces to be maintained. Accordingly, if a piece is rejected, all of the piece from the image entry feeder forward are rejected. The apparatus then prompts the operator to retrieve all of the rejected documents and re-feed the documents in the proper order to resume processing.

Editing/Re-Scanning

During processing, the operator may determine that a document needs to be re-scanned. For instance, the operator may notice on the display 20 that an image was too light. The operator selects the document by touching the thumbnail image. If the operator is unsure in which bin the document is located, the operator can press the switch button and switch to bin view, which will show the operator the bin that the document is located in, as discussed previously. The operator can then retrieve the document, and then select the re-scanning option from a list of editing functions. The document is then dropped onto the conveyor and is then scanned. The newly scanned image replaces the previous image.

Similarly, if the operator notices that a document needs to be added, such as being added into a transaction that was already processed, the operator selects the insert option from a list of editing functions. The document is then dropped onto the conveyor and is then scanned. The newly scanned image is inserted into the appropriate transaction and/or into the appropriate sequence in the batch. The operator can then replace the physical document into the proper document sequence. If the documents are sorted or stored according to document sequence number, then the document is inserted into the stack of documents according to the document sequence number (i.e. the order in which the document was originally processed; not the order in which it was re-scanned). In the alternative, the documents may be sorted and/or maintained according to the document scan number (i.e. the order in which the documents are scanned). If the documents are kept according to the document scan number, then the document should be in the proper order after it is scanned.

In certain instances the operator may determine that a document image should be deleted. To do so, the operator selects the thumbnail image of the document, and then selects the delete option from a list of editing functions, so that the image is discarded.

Yet another feature is the ability to move a document from one transaction to another or within the sequence in the batch. A document can be moved in one of two ways. First, the operator can select the document by touching the thumbnail image. The operator then selects the move option from a list of editing functions. The operator is then prompted to indicate where the document is to be moved to, such as which transaction and which document in the transaction. Alternatively, preferably the operator can move the document by drag and drop, by touching the thumbnail and dragging the thumbnail across the screen to the appropriate location.

Processing Documents with Detachable Portions

Overview

Frequently, documents include a portion that is designed to be detached. Most frequently, a perforation separates the detachable portion from the body of the document. The detachable portion of the document is commonly referred to as a skirt. In some instances, the skirt may include information that is helpful for further processing of the document. For instance, a check may have a skirt that identifies the invoice numbers that correspond to the payment. When processing a batch of documents, it may be desirable to process the checks with the skirt attached, rather than removing the skirt. Further, in certain instances, it may be desirable to selectively remove the skirt.

Accordingly, documents with skirts may be processed as follows. The operator treats the documents with skirts just as other documents: the documents are dropped onto the drop zone of the conveyor without detaching the skirt. The documents with the attached skirts are then imaged at the imaging station, and sorted into one of the sort bins. After the documents are removed from the output bin, the skirts may be removed if desired. Further still, if the documents with skirts have an identifying characteristic, then the image data can be scanned for the identifying characteristic. Documents having the characteristic can then be sorted to a particular bin. In this way, the skirts can be removed in groups rather than one at a time.

Since the skirts may be discarded, it may be desirable to discard the skirt portion of the image as well, thereby reducing the file size for the image data. Accordingly, if a document is identified as a document having a skirt, the image data may be parsed to only retain the portion of the data corresponding to the portion of the document without the skirt. Alternatively, the image data can be parsed so that the image of the skirt is in one file and the image of the remaining portion of the document is in a separate file.

The documents with skirts may also be sorted according to various characteristics on the documents. For instance, if the documents have a MICR or OCR line, the documents may be sorted according to information in the MICR or OCR line. In one application, the documents may be checks, and the MICR line may be read optically or by using a MICR reader, as discussed previously. The MICR line includes information that identifies the institution on which the check is drawn. Depending on the institution on which a check is drawn, it may be desirable to retain the skirt. Therefore, checks drawn on institutions for which the skirt should be retained are sorted to one bin, or one of several bins, while checks drawn on institutions for which the skirt should be discarded are sorted into different bins. In this way, the checks for which the skirt should be removed are separated from the checks for which the skirts should be detached and discarded. By separating the documents, the skirts can be detached in groups without removing skirts that should not be detached.

In addition to sorting the documents based on various characteristics, the image data may be parsed based on various characteristics. Depending on the account identified in the MICR line, the image data may be parsed so that only the image of the checks without the skirts is retained. The scanned image data for the skirts are deleted. Similarly, based on the account identified in the MICR line of a check, the image data may be parsed into two files: one file for the check image data, one file for the skirt image data.

Example

A common document having a detachable portion is a check with a detachable skirt, such as the check document 250 illustrated in FIG. 17. The check document 250 includes the check portion 251 and a skirt 252 (a second skirt 253 is shown in phantom, as discussed below). A perforated line 254 allows the skirt to be separated from the check. The check 251 is the negotiable instrument portion of the document that is processed by the financial institutions to make a payment from an account. The skirt includes information about the payment, but it is not part of the negotiable instrument.

A check with a skirt is a typical configuration for a business payment. The skirt will include information that is not necessary for the financial institution processing the check. Instead the information on the skirt is typically used to credit the payor's account. For instance, company A may send out several invoices to company B. In order to properly credit company B's account, any payment should include information regarding the invoices being paid. Rather than returning copies of the invoices with the check, the invoice information may be summarized on the skirt. This information is helpful to company A during remittance processing to ensure that the payment is properly credited to company B's account. However, since the skirt is not part of the negotiable instrument, the skirt is typically removed before the check is submitted to a financial institution. The process of manually removing the skirt portion after the documents are processed can be quite laborious, thereby reducing the efficiency for processing the documents. Therefore, it is desirable to reduce the manual labor required to remove the skirt portions of perforated documents or, if possible, to eliminate the need for removing the skirt manually.

One complication in improving the efficiency of processing documents having skirts is that there is no standard configuration for check/skirt documents. For instance, in FIG. 17 a two-panel document 250 is shown (with a third panel in phantom). The top part of the document is the check 251, the bottom part is the skirt 252, and a perforation 254 connects the two portions so that the skirt is more easily removable from the check. In FIG. 17 the check 251 is illustrated as being above the skirt 252, so that the skirt is attached to the bottom edge of the check. However, in another common configuration the skirt is attached to the top edge of the check. In addition, in some instances the document may be a tri-fold document having two skirt portions. FIG. 17 illustrates such a tri-fold document, with the second skirt 253 shown in phantom. For such a check document, the check may be the top third, middle third or bottom third of the document. However, typically the check is the top third or bottom third of the document.

An additional complication arises when processing the bi-fold check document shown in FIG. 17 because the document is generally square rather than being elongated. Therefore, the size of the document does not necessarily provide an indication of the orientation of the document. The check could be oriented in either an upright orientation or in a rotated orientation in which the check is disposed vertically. Accordingly, the check could be in any of four orientations: upright, rotated 90°, rotated 180° (i.e. inverted), or rotated 270°. Additionally, as discussed above, the check could be located either on the top half of the document or the bottom half of the document. Therefore, there are a total of eight different possibilities for the location of the check portion on the document.

In contrast, a tri-fold check document provides an indication of the orientation of the check because the check will be oriented so that the length of the check is oriented across the short side of the document. In other words, based on the measured length of the sides of the document, it will be apparent whether a document is either upright or inverted (i.e. a portrait orientation), or either rotated 90° or rotated 270° (i.e. landscape orientation). Accordingly, for certain documents it will be possible to determine information regarding the orientation of the check based on the dimensions of the document. For other documents the dimensions may not provide information regarding the orientation of the check.

Although the size and position of a check on a check/skirt document can vary, there are certain standard features. Referring to FIG. 17, one characteristic is the MICR line 255. A MICR line is a sequence of magnetic ink characters, and every check is printed with a MICR line. In addition, the MICR line characters are printed in a standardized font (such as E-13B in the United States) and the line is positioned within a predetermined area of a check. Specifically, the MICR line is positioned a predetermined distance from the right edge of the check. The length of the MICR line is also standardized.

There are several options for processing the documents to reduce the labor required to remove the skirt portions of the checks. The first step in each option is to locate the check portion of the document. In the present instance, the check portion is identified by locating the MICR line, which can be done by implementing either a global search or a local search.

Global Search

A global search entails searching the entire document in an attempt to locate the MICR line. Referring to FIG. 17, the document is scanned to create an image data file. The image data is analyzed line by line. When a mark is encountered the mark is analyzed to determine whether the mark is a MICR character. Specifically, the marking is compared with standard MICR fonts using one of any known pattern matching methodologies. One such method is disclosed in U.S. patent application Ser. No. 10/653,693, filed on Aug. 30, 2003, which is owned by the Opex Corp., which is also the assignee of the present application. If it is determined that the mark is a MICR character, the image data for the area adjacent the MICR character is analyzed to determine whether the identified character is part of a MICR line. The determination of whether a series of markings is a MICR line can include a verification step based on a measured characteristic, such as the length of the MICR line or the distance from the end of the MICR line to the edge of the document.

If the entire data file for a document is analyzed and a MICR line is not identified, the document may be in a rotated orientation. Accordingly, the image data is analyzed as discussed above to determine if the document is in a first rotated orientation, such as a 90° rotated orientation. If the entire data file is analyzed and a MICR line is not identified, the image data is again analyzed to determine if the document is in a second rotated orientation, such as a 180° rotated orientation. If the entire data file is analyzed and a MICR line is not identified, the image data is again analyzed to determine if the document is in a third rotated orientation, such as a 270° rotated orientation. If a MICR line is not identified in any of the four analyses of the image data, the document is identified as not being a check, and is subsequently processed accordingly.

Local Search

The second methodology for identifying the MICR line is a localized methodology. Under the localized methodology, rather than scanning the entire data file, select portion of the image data are analyzed. Since the location of a MICR line on a check is standardized and since the length and height of a MICR line is standardized, the possible locations for a MICR line on a document are limited to particular areas.

Referring to FIG. 18A a payment document is illustrated. The document 250 includes the check portion 251, the detachable skirt portion 252, and the perforated line 254. The document in FIG. 18A is one in which the check portion 251 has the skirt 252 attached to the bottom edge of the check. Therefore, the MICR line is positioned in the window identified as 260 when the document is in an upright orientation. As shown in FIG. 18A, in the upright orientation, the MICR line is positioned adjacent the middle of the document.

FIG. 18A also illustrates the alternative locations of the MICR line when the document is in any of three alternative orientations. For instance, if the document is rotated 90° counter-clockwise, the MICR line would be located within window 261. If the window is rotated 180°, the MICR line would be located within window 262, and if the window is rotated 270° counter-clockwise, the MICR line would be located within window 263.

FIG. 18B illustrates the four alternative locations of the MICR line when the skirt 252A is attached to the top edge of the check 251A as opposed FIG. 18A in which the skirt was attached to the bottom edge. Since the location of the check 251A on the document 250A is different, the location of the MICR line on the document will differ, as illustrated by the windows 260A, 261A, 262A, 263A in which the MICR line is expected to be depending on the orientation of the document. As in FIG. 18A, window 260A corresponds to the expected location of the MICR line if the document is in an upright orientation. If the document is rotated 90° counter-clockwise, the MICR line would be located within window 261A. If the window is rotated 180°, the MICR line would be located within window 262A, and if the window is rotated 270° counter-clockwise, the MICR line would be located within window 263A.

FIG. 18C overlays the windows in FIG. 18A and the windows in FIG. 18B. Rather than scanning the image data for the entire document, the image data corresponding to each window can be analyzed in succession to evaluate the presence of a MICR line. Specifically, the image data for each window is analyzed to identify the presence of a mark. If a mark is detected, the mark is analyzed to determine whether the mark is a MICR character. Depending on the window being analyzed, it may be possible to assume the expected orientation for a MICR character to aid in the pattern matching to analyze whether a marking is a MICR character. For instance, it may be assumed that if a MICR character is located in an area of an image corresponding to window 262, the MICR character would be in an inverted orientation. Therefore, rather than attempt to match a marking to a pattern indicative of an upright MICR character, the marking can be compared with a pattern indicative of an inverted MICR character.

If a mark is determined to be a MICR character, the image data in the rest of the window is analyzed to determine whether the image data includes a MICR line in the window. Since the orientation of a MICR character has already been determined, the subsequent pattern matching for the MICR line may be performed presuming the orientation of the remaining MICR characters is the same as the first MICR character. Accordingly, the remaining markings can be compared with a pattern indicative of a MICR character in the same orientation as the first identified MICR character.

If the image data for a window is analyzed and a MICR line is not identified, the image data for the next window is analyzed. The image data for the windows 260, 260A, 261, 261A, 262, 262A, 263, 263A, are analyzed in this way until either a MICR line is identified or the image data corresponding to all of the windows is analyzed. If a MICR line is not identified in any of the windowed areas, the document is identified as not being a check, and is subsequently processed accordingly.

Once a MICR line is identified, the apparatus can determine which portion of the document 250 is the check. For instance, if a MICR line is identified in window 260, the apparatus determines that the document 250 is in an upright orientation and that the skirt is attached to the lower edge of the skirt. Therefore, a boundary determination can be made for the document to determine which part of the document is the check 251 and which part is the skirt 252. Again, assuming that a MICR line is identified in window 260, the apparatus determines that the boundary for the check is along a horizontal line below the MICR line. Additionally, since the location of the MICR line relative to the bottom edge of the check is standardized, the apparatus may determine that the boundary line for the check is a line parallel to the direction of the MICR line and spaced below the MICR line a pre-determined distance.

In contrast, if the location of the MICR line indicates that the check is attached to the bottom edge of the skirt, the boundary line between the check and the skirt would not necessarily correspond to a pre-determined distance from the MICR line, because the height of a check is not currently standardized. Therefore, some other criteria is used to determine the boundary line between the check and the skirt. For instance, many skirts are configured to be the same size as the check. Therefore, in some instances, the location of the boundary may be determined based on the height of the document and the location of the MICR line. Specifically, the height of the document may be determined based on the image data. If the MICR line in identified in window 260A, the boundary line may be determined to be a line parallel to the MICR line located halfway up the height of the document.

In addition to the location of the MICR line, other characteristics can be used to identify or validate the location of the boundary for the check. For instance, other features such as the location of the courtesy box 256, the date entry line 257 or the address 258 printed on the check, are all characteristics that may be used to identify which portion of the document is the check.

The various features may be used in combination to increase the likelihood that the check identification decision is correct. For instance, the MICR line 255 may be identified as described above. In addition, based on the location and orientation of the MICR line, the image data may be analyzed to attempt to identify the address 258 on the check, which is customarily located in the top left portion of a check. As with the MICR line, the address determination can be done using either a global search of the document or using a local search to search select portions of the document in which the address would be expected to be based on the orientation of the check.

The address determination can be used to validate the MICR determination. For instance, if the address is identified in an orientation and location corresponding to the orientation and location determined for the MICR line, then the location and orientation of the MICR line is validated so that it can be used to determine the check boundary. If the location of the address does not correspond to the location or orientation of the MICR line, then the document may be flagged for alternative processing.

Additionally, the address determination can be used in combination with the MICR line to determine the boundary for the check. For instance, the MICR line can be used to identify the bottom edge of the check, and the address can be used to identify the top edge of the check.

In addition to the global and local searches discussed above, the apparatus may also provide for manually identifying the portion of the document that corresponds to the check. For instance, the image of the document may be displayed on a monitor, and the operator may be provided with a mechanism for dragging a window around the portion of the window corresponding to the check. One such mechanism may be a touch screen monitor. Using a touch screen, the operator touches the screen to locate and size a selection window around the check portion of the document. Alternative input devices could also be used, such as a mouse, a touchpad or a stylus.

Additionally, the input device can be used to override and/or confirm the boundary determination made by the apparatus in some situations. For example, the system may make a best guess determination of the check boundary based on various characteristics. If the level of certainty of the determined boundary is below a certain level, the system may make a best guess of the boundary and then display the scanned image of the document 250 with a window showing the best guess for the boundary. If the best guess is accurate, the operator can confirm the selection. If it is inaccurate, the operator may resize and/or re-locate the window to select the correct boundary for the check.

Assigning Reference Information

Another aspect of the apparatus 10 is the ability to assign one or more pieces of information to one or more documents. The information may be assigned to a single document, or it may be assigned to each document in a transaction, batch or entire job comprising multiple batches. An interface between the operator and the apparatus is provided so that the operator can input the reference information when necessary or desired.

One way that reference information can be assigned is according to batch. Typically, batch information is assigned prior to processing a batch of documents. Each document in the batch is then tagged with the batch information. The batch reference information can be physically or electronically tagged to the document. The reference information can be physically tagged to a piece by printing the reference information onto the piece by the printer modules 190. The reference information can be electronically tagged by associating the reference information with each piece in a data file. For instance, a data file may be produced for each document, which would include information such as the sequence number of the piece, the image data for the piece, the date and time the piece was processed and information that is electronically determined, such as a MICR line or an OCR line on the piece. The batch reference information may be included in the data file for the pieces in the batch.

Another way that the reference information can be electronically tagged to a piece is by electronically printing the information on the image data. More specifically, the reference information is added to the image data for a piece, so that when the image data is viewed, the reference information also appears. The reference information may overlay the image data so that when the image is viewed, the information appears to be printed on the document. However, more typically, the reference information is positioned adjacent the image when the image is viewed.

When tagging a batch of documents with batch reference information, it is desirable to identify the reference information before processing the documents. By identifying the batch reference information prior to processing, the reference information can readily be tagged to the documents while the documents are processed through the apparatus. In some instances it may be acceptable to identify the batch information after the batch has been processed.

The batch reference information may be identified in any number of ways. The operator may key reference information into the apparatus through a keyboard or similar input device. Alternatively, the operator may scan in the reference information. For example, the apparatus may include a hand-held bar code scanner that can be used to scan in reference information. It is also possible to identify the reference information by the processing of a document. For instance, before the first piece of mail is processed, a document with reference information printed on it may be processed through the machine. The reference information may be a bar code, a MICR line, characters that can be read using OCR or otherwise. The apparatus may scan the first piece, read the reference information and then identify the documents with the reference information.

In addition to tagging documents with reference information according to batch, documents can be tagged by transaction. A transaction includes one or more documents that were extracted from a single envelope. In certain instances it may be desirable to tag each document in a transaction with reference information that relates to the transaction. As with batch reference information, transactional reference information may be identified manually by operator input or automatically by scanning information off of a document. After the transaction ends, the apparatus stops tagging documents with the transactional reference information.

In addition to batch and transaction basis, reference information may be tagged to a document on a document by document basis. As with the discussion of batch and transactional reference information, document reference information may be identified manually by the operator or automatically. With document reference information the information is only tagged to the specific document. However, the document may have several pages. If the document has several pages, each page is tagged with the document reference information.

As discussed above, reference information can be tagged to documents on a job, batch, transaction, or document basis. In addition, the references can be combined so that a particular document in a job could be tagged with job reference information, batch reference information, transactional reference information and document reference information. Further still, there can be several of each type of reference information. For example, a document may have a single piece of batch reference information, two pieces of transactional reference information, and two pieces of document reference information.

The following examples illustrate applications of the reference information described above. A first example illustrates a use of batch reference information. One of the benefits of scanning images of each document is that the document images can be used during subsequent processing rather than the original documents. Therefore, it may be possible to discard the documents and simply use the images. However, it may be necessary to examine an original document in the event that there is a dispute regarding the document or if the document image does not show all of the details of the document. Accordingly, it is desirable to provide an efficient method for scanning and archiving documents to facilitate retrieval of the documents at a later time if necessary.

The first step is to provide a container, such as an archive box, onto which a unique identifier is attached, such as a barcode. Before processing a batch of documents, the operator is prompted on the view screen to enter the identification number for the archive box. The operator scans the bar code using a hand held bar code scanner connected to the apparatus 10. The bar code scanner reads the bar code and decodes the bar code to determine the identification number for the archive box. Alternatively, the identification number may be printed on the archive box and the operator may manually key the box number into the system. The identification number for the box is the reference information for the batch of documents that are to be stored in the box. Other reference information may also be identified for the batch. For instance, the operator may key in information about the batch of documents such as what type of documents are in the batch or where they were received from.

After the operator identifies the batch reference information, the operator processes the documents. For instance, if the batch of documents were previously extracted from envelopes, the documents are processed by dropping the documents onto the conveyor, either alone or in packets. The documents are then scanned and discharged into the output bins, as described previously. During the processing, the documents are electronically tagged with the batch data. After processing, the documents are removed from the bin or bins and placed into the archive box. Preferably, a sequence number and the box identifier are printed onto the documents as the documents are processed, and the documents are placed into the archive box in order according to sequence.

Further processing for the processed documents can be accomplished using the image data for the documents. However, if the physical document needs to be retrieved, the data file for the document will identify the identification number for the box in which the document is located, and the sequence number for where the document is located within the box. And again, when the document needs to be replaced, the correct box and location can be readily determined since the document is tagged physically and/or electronically with the box identification number and sequence number.

In this way, a processed document can be easily stored and efficiently retrieved if necessary. At the same time, the box identification number can be identified directly off of the box and the documents can go directly into the box after processing, so that there is less likelihood of placing the wrong documents into the wrong box. Further still, if the box identification number is scanned in off of the box, such as by a bar code scanner, there is less likelihood that the wrong identification number will be entered for a box, which could lead to mis-storing a batch of documents.

A second example of using reference information is an application that uses transactional reference information. Frequently, important papers are sent via a courier that tracks the location of the package from the person shipping the package to the person receiving the package. Most typically, the packages are tracked using some type of bar code. When the documents are received and then scanned it is desirable to retain the tracking information for the package, but it is normally desirable to discard the packaging materials.

Accordingly, a batch of documents in courier envelopes, such as Federal Express, Express Mail or UPS envelopes, can be processed as follows. Before processing the documents in the envelope, the operator identifies the transactional reference information for the documents in the envelope. The operator can key in the information, but preferably, the operator simply scans the tracking bar code on the package with a bar code scanner. After the transactional reference information is identified, each document in the transaction is processed and each document in the transaction is tagged with the transactional reference information.

To ensure that the transactional reference information is associated with the proper document, it is important to ensure that the beginning and ending of a transaction are properly defined. The beginning of a transaction can be identified manually by the operator pushing a start transaction button. Alternatively, the beginning may simply be defined by the act of scanning the tracking bar code. The system then treats every subsequently processed document as a part of the transaction until the end of the transaction is somehow indicated.

The end of the transaction can be identified in one of several ways. For instance, the operator can manually depress an end of transaction button after the last document in a transaction is processed. Alternatively, the act of scanning in the next tracking bar code can indicate the end of a transaction. Specifically, after scanning a tracking bar code, the system may assume that all of the subsequent documents are part of the same transaction until the operator scans the next tracking bar code. In other words, the act of scanning the tracking bar code can operate as both an indicator of the end of the previous transaction and the beginning of the next transaction.

As described above, the transactional reference information is typically identified prior to scanning the transactional documents. The method described above included scanning the bar code for the reference information, processing a transaction, and then scanning the next bar code for the next transaction. It may be desirable to identify the reference information for the next transaction while the previous transaction is still being fed into the imaging station. Accordingly, the operator may indicate whether the reference information is for the current transaction or the next transaction. This option may be displayed on the view screen as a choice to be made before inputting the reference information.

In this way, while the machine is processing a transaction, the operator may select the next transaction option and then enter transaction reference information for the next transaction. The system will continue to process the current transaction with the previous transaction reference information. After the current transaction is done being processed, the operator drops the next transaction onto the drop conveyor and the system tags the documents with the new transactional reference information that the operator had already entered.

Another application of reference information relates to the use of document reference information. In certain instances, the operator may manually identify some information regarding a document that may not be readily identifiable from the scanned image. Prior to scanning the image, the operator identifies the information regarding the document, such as by keying the information into the system via a keyboard or other interface, such as a voice recognition system. For instance, if a document has unusual printing that does not optically scan well, someone looking at a subsequently scanned image of the document may not know that the image may not accurately reflect the document. One example is a document that has red ink, which does not scan as well as other colors. When viewing the image, it may appear that certain information is missing or incorrect. If the person viewing the image knows that the document had red ink printing, the person may more readily recognize the need to retrieve the original document, or alter the processing of the document. For example, during subsequent processing, an operator may be viewing the document image to credit a customer's account with a payment. The amount of the check may not be clear on the check because the printing on the check could not be properly scanned (e.g. too light, printed in red ink etc.). The operator may have recognized the problem with the printing, manually read the check amount, and keyed in the information as document reference information. During subsequent processing, the operator crediting the payment may use the document reference information to identify the amount to credit the customer's account.

Yet another example of using document reference information is a memo on the memo line of a check or document. A customer may write information in the memo line, such as an account number, or some indication that the document requires special processing. When scanning the documents, the operator may manually read the information and input the information as document reference information. The document reference information can be used to sort the document. For instance, the document reference information could specify that there is a change of address. All such documents may be sorted separately. Subsequently, an operator may process all of the change of address documents at once by viewing the image, and keying in the change of address information into the customer's account.

Each of the foregoing applications only discussed using a single type of reference information. However, as previously described, the different types of reference data can be used in combination. For instance, in the first example, all of the documents in a batch were tagged with batch reference information regarding the identification number for the box into which the documents were to be archived. If the batch includes a package with a tracking bar code, the operator may scan the bar code so that all of the documents in the package are identified with the tracking information. At the end of the transaction, the operator presses an end of transaction button to indicate the end of the transaction. Further, when processing one of the documents in the transaction, the operator may notice an anomaly with one of the documents. Before processing the document, the operator may input reference information for the specific document, and only that particular document is tagged with the document information.

In yet another example, a job may include two batches of documents that are mixed together, and the documents are to be tagged with the proper batch reference information. In such an instance, the proper batch reference information should be identified for each document, which can be done in one of several ways. For instance, the operator can identify batch reference information for a first batch, and the documents are tagged with the first batch reference information until some document is identified as being in the second batch. This can be done by the operator indicating that the document is one being in the second batch. The document is then tagged as being in the second batch. Similarly, subsequent documents may be tagged as being in the second batch until a document is identified as being in the first batch. Alternatively, the batch into which a document belongs may be identified on a document by document or transaction by transaction basis.

One application of the mixed batch would be in which a job of documents needs to be sorted according to certain criteria and stored into two different boxes. Before processing a job of documents, the operator may scan the first box and identify the box identification number as batch reference information for a first batch of documents. The operator may then scan the second box to identify the box identification number as batch reference information for a second batch of documents. As the documents are processed the documents are identified as being in the first batch or the second batch. The batch reference information is tagged on the respective documents and the documents are sorted into the output bins according to batch. The documents from the first batch are then removed from the appropriate bins and placed into the first box and the documents from the second batch are removed from the appropriate bins and placed in the second batch.

As mentioned previously, the batch to which a documents is identified may be determined in a number of different ways. One example in the mixed batch application is to sort documents into batches based on information on the documents. Specifically, a MICR line includes information regarding the institution from which a check is drafted. The MICR line may be read for the checks as the documents are processed, and the documents can then be sorted into batches according to the institutions. For instance, if a transaction has a check that is drafted from a first institution (or group of institutions), then all of the documents in the transaction are identified with first batch reference information. If a transaction has a check that is drafted from a second institution (or group of institutions), then all of the documents are identified with second batch reference information.

As can be seen from the foregoing examples, the apparatus 10 provides flexibility in processing documents in an automated manner, while allowing for efficient intervention from the operator to supplement or modify the information stored for the documents. Several applications have been described, however the flexibility of the features of the apparatus allow it to be used in a wide variety of applications using different combinations of different features. Accordingly, although particular embodiments and applications of the apparatus have been illustrated and described, it is not intended to be limited to the specific embodiments and applications.

Processing Lockbox Documents

Rather than handle payments that come in the mail, many companies pay lockbox processing companies to process the mail and deposit the payments. Lockbox mail is included in first-class mail, but it represents a small portion of the mail. However, due to the nature of wholesale lockbox mail, the lockbox mail receives special expedited treatment. Specifically, wholesale lockbox mail typically includes relatively high dollar volume payments. Therefore, the lockbox customers desire to have the payments deposited into their accounts as rapidly as possible.

Although customers desire rapid deposit of the checks, the nature of wholesale payments makes it difficult to process the payments using high-speed automated systems. Wholesale lockbox payments are typically not “clean mail” (i.e. mail pieces that includes a single check and a corresponding standardized payment stub). Wholesale lockbox payments are typically business payments and different businesses make their payments in different ways. Some businesses pay multiple invoices with a single check; some businesses include copies of the invoices, some do not; and some businesses simply print the invoice numbers and amounts on a skirt attached to the check. In addition, often the size of the documents is larger than the size of documents that can typically be handled by high-speed fully automated systems. This variety in the format of the transactions has typically required significant human intervention to process the payments.

Further, lockbox processors typically process mail for hundreds or thousands of customers. Often a zip code will be designated for a particular lockbox processor, so that the post office simply groups together all of the mail to the zip code and either delivers it to the lockbox processor or notifies the lockbox processor that the mail is ready for pick-up. The lockbox processor may use P.O. box numbers to identify each customer (i.e. each customer is assigned a separate P.O. box number). Therefore the processor can separate the mail for each customer by sorting the mail according to P.O. Box. Since each customer is different, some customers may receive large quantities of mail on a regular basis, while some customers may receive small quantities of mail. One of the challenges in processing the lockbox mail is to efficiently process the payments rapidly in light of the variety of types of mail, the variety in the volume of mail for each customer and the variety in how each customer wants the documents handled.

Overview of Lockbox Method

As discussed further below, the mail processing apparatus 10 can be utilized to process lockbox payment in a variety of ways. First, the documents are extracted from the envelopes and then scanned to acquire image data for each document. As the documents are scanned, the apparatus 10 separates the document images to group the documents together according to transaction. In other words, the apparatus groups together the images for documents removed from a particular envelope and separates the images from images for documents extracted from other envelopes. The document images are analyzed to extract various data, such as the payor, the payee and the amount of the payment. The payments are credited to the payees' accounts and a report may be provided for each payee that summarizes information regarding the payments received. In addition, the document images and the extracted information may be combined to provide electronic records that include the information necessary to submit the checks to a financial institution to clear the checks.

Rather than separating the batches of mail by P.O. Box (i.e. by customer) and then separately processing the mail for each customer, the method can be utilized to process mixed batches of mail that include numerous customers, thereby eliminating one or more sorting steps. Further, by scanning each document and using the image data to process the documents, the need to handle the documents during numerous processing steps is eliminated. In addition, each payment can be processed separately from extraction of the documents to the deposit of the check, without delaying the deposit of the check to group the check with other payments for the customer. The information for the various payments can be combined together into a report to the customer after the deposits are made. These and other advantages are discussed in greater detail in the following sections.

Extracting and Scanning

The apparatus 10 is operable to process batches of mail in a variety of ways. According to one methodology, the mail is sorted so that the mail for each customer is grouped together. The mail is then processed customer by customer so that all of the payments for a particular customer are processed together and deposited together.

Alternatively, the mail may be processed without sorting the mail by customer. Instead, the mail is processed as a mixed batch with a plurality of payments from numerous payors to numerous payees. For instance a batch of 20,000 mail pieces may include payments from 20,000 payors to 500 payees. In some instances lockbox processors may utilize work groups that process the mail for certain groups of customers. For instance, a lockbox processor may use 10 work groups to process the mail for 500 customers. Each work group may be assigned approximately 50 customers. Therefore, when the mail is received, the mail is sorted into groups of customers. In other words, the mail for the customers that work group 1 processes is separated from the mail that work group 2 processes. These smaller batches of mail may then be processed by the work groups as mixed batches of mail.

To process the documents, the mail is opened and the documents are extracted from the envelopes. The extraction may be performed on the apparatus 10 by the operator as described previously. However, in some instance, it may be desirable to extract the documents using a different apparatus and then utilize the present apparatus 10 to scan and sort the documents. Although either method can be used, the following discussion describes the methodology using the apparatus 10 to extract the documents.

A mixed batch of mail is placed into the input bin and the feeder 30 serially feeds the mail. From the input bin, the pieces are conveyed past one or more cutters that cut open one or more edges of the envelopes. The envelopes are then conveyed to the extraction station 70 where the cut envelops are pulled open to present the contents to the operator. The operator manually removes the contents and drops the contents onto the roller bed either one at the time or as a packet.

Before dropping the documents onto the conveyor 100, the operator may visually inspect the documents for certain characteristics. For instance, the operator may examine the check to ensure that the check is made out to a customer and that the customer is properly identified. In other words, if the lockbox processor does not handle the mail for whom the check is made out, the operator may place the check and the accompanying document into an outsort bin to be handled separately. If the operator identifies some other problem with a check or an accompanying document, the operator may outsort the document(s) rather than scanning the document(s).

As discussed previously, the mail received by a wholesale lockbox processor may be in any of a variety of formats. Typically the mail pieces are payments so that each transaction includes a check. The check may be the only document in the transaction, or there may be other documents in the transaction, such as a copy of the invoice(s) being paid. If a documents is folded, the operator unfolds the document and drops the document onto the conveyor 100.

The documents are fed into a scanner and image data is acquired for each document as described previously. In addition, it may be desirable to scan the documents to attempt to read any MICR characters on a document. Specifically, checks are encoded with a MICR line that includes information regarding the check, such as the bank and the account which the check is drawn against. Accordingly, as described previously, a MICR reader may scan each document to attempt to read the MICR line on the documents. To facilitate reading the MICR line, the operator may drop the check onto the conveyor in a particular orientation to ensure that the MICR line is aligned with the MICR reader. For instance, the MICR reader may be oriented to read the MICR line when the checks are disposed in an face-up, upright orientation. Therefore, the operator would drop the check onto the conveyor so that the front face of the check is facing up and the bottom edge of the check is toward the guide rail 105 of the conveyor. However, preferably, the MICR reader is configured so that the MICR reader can read the MICR line when the check is in a variety of orientations. For instance, preferably the MICR reader can read the MICR line when the check is face down and when the top edge of the check is toward the guide rail. However, preferably, the operator drops the check onto the conveyor so that either the right or left side of the check is the leading edge as the check enters the MICR reader, rather than the top or bottom edge of the check being the leading edge. In other words, preferably the check is fed into the MICR reader in a landscape orientation so that the MICR characters in the MICR line are serially conveyed past the MICR reader.

After the documents are scanned, the documents are sorted into a plurality of output bins. As discussed previously, the documents can be sorted in a variety of ways depending on the needs of the lockbox processor and the desires of the customers. For instance, since the documents have been scanned, the images of the documents may be forwarded to the customers, rather than forwarding the actual documents. However, some customers may desire to receive the actual documents. Documents for customers who desire to receive the actual documents may be sorted to a separate bin so that the documents can be forwarded to the customer.

As described previously, the images of all of the documents from a single envelope are associated together as a single transaction. This is referred to as maintaining transactional boundaries, and as described previously, several methodologies can be utilized to maintain the transactional boundaries, such as determining the transactional boundaries based on the transition from a first type of document to a second type of document or by the operator manually identifying the transactional boundaries, such as by pushing a button. Although the images are maintained so that the images for the documents are associated together, the actual documents for a transaction may be separated from one another. For instance, the documents may be processed so that all of the checks are sorted into one or more output bins, while all of the accompanying documents are sorted into one or more separate bins.

Analyzing Images/Information Acquisition

Since the documents are scanned to acquire images of the documents, the document images can be used during subsequent processing rather than referring to the actual documents. In order to process the payments, several pieces of information are determined. For example, the payor, the payee and the amount of the payment are determined.

Determining Payment Information

Much of the information for a payment may be extracted from the image data for the check. Specifically, each payment includes a check, which identifies the payee, the amount of the payment, the payor, the payor's bank, and the payor's bank account information. The relevant payment data can be determined for each payment either automatically or manually by an operator. Specifically, as discussed further below, the image data for the documents in a payment can be automatically analyzed to attempt to determine and/or validate each piece of data needed for a payment record. Alternatively, the image data for each document can be displayed for an operator who can view the image, determine the payment information and input the payment data. Although it may be possible to determine the payment information using either an automated methodology or manually using operator intervention, preferably the payment information is determined using a combination of automated and manual techniques, as described further below. For instance, the payment information may be determined using various automated techniques, and the payment data may be validated and corrected as necessary by an operator before the payment record is finalized.

Automated Data Determination

The following discussion describes several techniques that may be employed to determine various elements of payment data without human intervention. First, as described above, a MICR reader may be employed to scan each check to read MICR information on the documents. The most common type of MICR information is the MICR line on checks. If the MICR reader identifies and reads the MICR line, the information from the MICR line may be added to the data record for each payment. The data from the MICR line may include the identification of the payee's bank and bank account number.

In addition to the data from the MICR reader, an image processor, in the form of a microprocessor may be used to analyze the image data for each document in a transaction to identify various pieces of payment data. For instance, the image data for a check may be analyzed using optical character recognition (OCR) to identify data. Although the documents in wholesale lockbox mail vary, the general format for checks has certain standards. For example, on a check, the payor is typically identified on the upper lefthand corner of the check, the MICR line is located on the bottom edge of the check, the payee is located in the middle of the document, next to the courtesy box in which the amount of the check is identified in numerical format, and the amount of the check is again printed, but in word form, below the payee. Further, the check is normally dated on the upper righthand side of the check.

Although much or most of the information for a payment may be identified by analyzing the image data for a check, some data for the payment may be identified on documents accompanying the check. For instance, a payment may include a copy of the invoice or invoices that are being paid. Although the form and format for the documents received for all of the lockbox processor's customers may vary greatly, the documents for each particular customer may include certain consistencies. Therefore, the documents that accompany a payment for a particular customer may be analyzed using expected characteristics for the documents for the particular customer's documents.

For instance, each customer likely uses a similar format for all of its invoices. Therefore, a database may store information for the layout of a particular customer's invoices. When a payment is identified for the customer, the documents in the transaction are analyzed to determine whether the document's characteristics match the characteristic features stored in the database for the customer's invoices. If the characteristic features match, then the document is analyzed to determine various payment data, such as the invoice number and the invoice amount.

For example, customer A may use an invoice that is 8½×11 inches, and includes the customer's logo in the upper lefthand corner. Further, the invoice includes an OCR line at the bottom of the invoice, which identifies the customer account number and the amount of the invoice.

A transaction including two documents is analyzed. The first document is determined to be a check, and an analysis of the image data for the check indicates that the check is payable to customer A. The information in the database regarding customer A's invoices is retrieved from the database, and the second document is analyzed to determine whether the document has the characteristics expected for customer A's invoices. Specifically, if the document is 8½×11 inches (or within an acceptable range) and an analysis of the image data indicates a pattern matching customer A's logo, then the document is determined to be customer A's invoice and the document is analyzed accordingly. The image data for the document is then analyzed to identify and read the OCR line at the bottom of the document to determine the invoice number and amount.

In addition to identifying the document as customer A's invoice, the identified characteristics can be used to determine the orientation of the document. Referring to the above example, if customer A's logo is expected in the upper righthand corner and the logo is identified in the lower lefthand, then the document is determined to be inverted (i.e upside down), and the image data is analyzed accordingly to identify the OCR line.

As discussed above, a database may include stored information regarding various characteristic of documents for particular customers. This information can be retrieved to analyze documents or image data for the documents to determine information regarding the particular payment. Although a particular example has been discussed above, the methodology for processing lockbox documents is not limited to a particular process for analyzing documents. Rather, a variety of techniques may be utilized for automatically identifying documents and determining document characteristics.

Manual Data Determination

In addition to the automated data determination techniques discussed above, it may be desirable to use human operators to determine some or all of the payment information for a transaction. The simplest manner for utilizing a human operator is to display the image data on the display 20 so that the operator of the apparatus 10 can view the documents and key in the relevant payment information. For example, the documents for each transaction may be displayed on the display 20, and the operator reads the image of the check to identify the name of the company making the payment, the company to whom the payment is made, the amount of the payment etc. The operator then inputs the payment information using an input device, such as a keyboard.

Although the image data can be displayed locally by the apparatus 10, it is typically more efficient to export the image data so that the image data can be viewed at a remote location by a different operator. For instance, the image data may be exported to remote workstations via any of a variety of connections, such as over a local area network or via a secure connection over the internet. At the remote workstations, the operators view the image data and key in the relevant information for each payment. The payment information for each transaction is then stored in a database along with the image data for the documents in the transaction.

Validating Payment Information

In addition to identifying the information for a payment, it is desirable to validate some or all of the data regarding a payment. As with the process for identifying the data, the payment data may be validated manually, automatically or by using a combination of manual and automated processes.

One method for automatically validating payment information is by verifying the payment data from one document in a transaction with information from elsewhere in the transaction. For instance, using OCR it may be possible to identify the amount of a payment by analyzing the image data for the courtesy box on a check. The courtesy box is the area on a check where the check amount is written in numerical form (e.g. $123.45). The payment amount can be validated by using OCR to analyze the image data for the payment line, where the check amount is written in word form (e.g. One Hundred Twenty Three and 45/100). If the OCR analysis of the courtesy box and the OCR analysis of the payment line result in the same number, then the payment amount may be considered to be validated.

In other instances, payment data may be validated using information identified from other documents in a transaction. For example, a transaction may include a check and a copy of an invoice. As described previously, using various templating or other document analysis techniques, it may be possible to use OCR to analyze the image data for the invoice to identify the invoice amount. In addition, as described above, using OCR, the payment amount for a check may be determined automatically. If the invoice amount determined by OCR matches the check amount determined by OCR, then the payment amount may be considered to be validated.

Further, in some instances it may be desirable to scan the envelope for a transaction to acquire image data for the envelope. In such an application, the envelope may be processed as if it is a document in the respective transaction. In certain applications, the information from the envelope can be used to verify information regarding the payment. For instance, as discussed previously, typically, a lockbox processor assigns a P.O. box number to each customer. Therefore, the customer for a particular lockbox should be the payee on checks directed to that particular lockbox. The image data for an envelope may be analyzed using OCR to identify the P.O. box number in the address. If the P.O. Box number on an envelope does not match the payee on the check in the transaction, then the documents may be electronically tagged and sorted separately from the validated document. Additionally, or alternatively, the image data for the documents in the transaction may be identified and stored separately from the images for the validated documents.

In addition to automatically validating the payment information, some of the data may be validated manually. For instance, the image data may be processed by an image processor as described above to determine various data regarding a payment. After the image data is processed to automatically determine the payment data, the image data may be displayed to an operator who can view the image data along with the payment data determined by the image processor. The operator can then either confirm the correctness of the payment data determined by the image processor or correct the payment data by inputting the correct payment information. Similarly, if any of the payment data cannot be identified automatically, the operator can input the information. If the operator adds payment data or changes the payment data, the operator may be prompted to confirm the payment information after the information is input.

As mentioned, one of the pieces of information to be determined and preferably validated is the payee. In particular, it is desirable to identify mis-directed and mis-identified payments so that the payments are not processed and potentially deposited inappropriately. Mis-directed payments are payments that are payable to a company that is not one of the lockbox processor's customers. For instance, if the mail includes a check payable to ABC Inc., and the lockbox processor does not process mail for a company named ABC, then the payment is mis-directed. Mis-identified payments are payments that are directed to one of the lockbox processor's customers, but the customer name is mis-identified on the check. For instance, the customer name may be Smith Machinery Inc. and the check may be made out to Smith Inc. or Smith Machinery. The identification Smith Machinery would likely be an acceptable identification for depositing the check into the bank account of Smith Machinery Inc, however Smith Inc. may not be an acceptable identification.

Generally, each customer has a listing of acceptable indications for the customer's name. For the Smith Machinery Inc. example, the list of acceptable identifications may include Smith Machinery Inc., Smith Machinery, Smith Machine, Smyth Machinery Inc, Smithe Machinery Inc, and other variations. If any of these variations are used on a payment, the payee identification is acceptable. If not, the payment is considered to be mis-identified.

Mis-directed or mis-identified mail may be identified in a variety of ways. For instance, the operator processing the mail may manually identify mis-identified or mis-directed mail. After extracting the mail, and before dropping the mail onto the conveyor 100, the operator examines the check in a transaction to identify the person or company to whom the check is made payable. If the payee on the check is not a valid payee, the operator places the check and any accompanying documents into an outsort bin so that the documents are not scanned. Instead, the mis-directed or mis-identified payments are processed separately, as discussed further below. Using the operator to identify mis-directed and/or mis-identified mail may be an efficient use if the mail is pre-sorted into batches for an individual customer or a group of a limited number of customers. However, for mixed mail that potentially includes mail for a large number of customers it is less likely to be efficient to use the operator to identify mis-directed or mis-identified payments before the documents are scanned.

Rather than using the operator, the payee may be identified and validated automatically. As discussed previously, using OCR, an image processor may analyze the image data for a check to determine the person or company to whom the check is made payable. In addition, a data file may be maintained that identifies all of the acceptable identifications for all of the lockbox processor's customers. After the payee is determined using OCR, the identified payee is compared to the list of acceptable customer identifications. If the identified payee does not match one of the acceptable identifications, the documents may be electronically tagged and outsorted. Alternatively, or additionally, the document images may be identified and/or stored separately from the properly identified document images.

Processing Mis-Directed or Mis-Identified Documents

It is desirable to separate mis-directed and mis-identified mail from the rest of the mail to ensure that checks are not improperly deposited. A problem may arise in depositing or clearing a check if the check mis-identifies the customer or if the check is mis-directed. In other words, processing mis-directed or mis-identified checks can lead to significant accounting problems downstream in the process.

Mis-directed or mis-identified payments may be processed in one of several ways. Documents that are identified by the operator and separated prior to being scanned can be processed fairly easily. Mis-directed mail can simply be returned to the sender. For mis-identified mail, the customer may be contacted to determine how the documents should be processed, or the documents can simply be forwarded to the customer with a notice indicating the problem.

There are also several ways to process mis-directed or mis-identified payments if the operator does not outsort the documents prior to scanning the documents. The most basic method is to retrieve the documents and re-assemble the transaction and process the documents similar to instances in which the documents are outsorted prior to being scanned, as described above. In such an instance, the image data for the documents may also be deleted. More specifically, after the documents are scanned to acquire image data, the image processor may analyze the data to identify the payee. The identified payee is then compared to a list of acceptable customer identifiers that is maintained electronically, such as in a database. If the comparison does not provide a match before the documents are to be sorted in the bins, the documents may be electronically tagged as mis-directed or mis-identified and sorted to a reject bin. Alternatively, the documents may be sorted into the bins and if the comparison subsequently determines that the payee does not match an acceptable customer identifier, the images may be electronically tagged as mis-directed or mis-identified and the location of the documents in the bin may be identified so that the operator can readily retrieve the documents from the bins for further processing.

An alternative method for processing mis-directed or mis-identified documents uses the document image data so that copies of the documents can be printed to make a copy of the transaction. For mis-directed mail, the copies can be sent to the person who sent the documents along with either a request for how to process the documents or with an indication that the original documents have been destroyed. For mis-identified documents, the transactional documents can be forwarded to the customer with a request for how to proceed. Alternatively, rather than forwarding paper copies of the mis-identified documents, the document images can be forwarded to the customer, with a request for instructions for how to proceed. After the documents or images are forwarded to the customer, the documents and/or images may be maintained in a pending status without further processing while awaiting the customer's instructions. The documents images can be returned to the customer in any of several ways. For instance, the images can be stored on a non-volatile storage medium such as a magnetic tape, a CD or a DVD and the storage medium can be forwarded to the customer. Alternatively, the images may be sent via email to the customer or stored on a secure web server that the customer can access.

Processing Validated Payments

After the information for a payment has been validated, two steps remain, clearance and reporting. In addition, typically, after the payment amount for a payment instrument is determined, the payment amount is printed onto the payment instrument. For instance, the check amount is printed onto a check in a MICR font so that the MICR line at the bottom of the check is extended to include the check amount.

The clearance process involves forwarding the payment instruments to a financial institution for the clearing process. In the past, the payment instruments (e.g. checks, money orders and bank checks) were separated according to financial institution and also possibly by customer. In other words, first, the checks are sorted out from the other documents in the transactions. Then the checks were sorted so that all of the checks drawn against Bank A were grouped together and all of the checks drawn against Bank B were grouped together. Additionally, if the mail was sorted according to customer, the mail would be sorted so that all of the checks for Customer X drawn against Bank A were grouped together. After the payment instruments were grouped together in the relevant groupings, the payment instruments were forwarded to a financial institution, such as a clearing house, to complete the clearance process. Since the apparatus 10 is operable to sort the checks from the other payment documents, the checks can be sorted out and then further sorted for clearing in the manner described above.

Rather than clearing the checks by submitting the papers documents for clearance, the images of the checks can be used to submit the checks (and other payment instruments) for clearance. Specifically, the image data is combined with the payment information regarding the payment to form a payment record that can be submitted electronically for clearance. For instance, the images for a check (preferably the images for the front and back of the check) and the MICR line information may be combined to create a payment record for submission to a financial institution for clearing the check.

The payment records may be grouped together according to the bank that the checks are drawn against, and then submitted for clearance in batches. Alternatively, the payment records can be grouped together and submitted at certain times, such as hourly or four times a day, or otherwise. However, rather than holding payments to group them together, it may be desirable to submit each payment record individually after the payment record is completed and validated. In this way, the payments are processed serially so that the documents are scanned, analyzed, validated and submitted for clearance individually, without being batched with other payments for clearance. In other words, in a batch of 20,000 mail pieces that contain 20,000 checks, there will be upwards of 20,000 different steps of submitting payment records for clearance (assuming that each check is properly executed and properly identifies a customer of the lockbox as a payee). Therefore, for a batch of mail, the number of separate submissions of payment records to a financial institution will be substantially similar to the number checks in the batch of mail. Eliminating the step of grouping the payments together for clearance, reduces the time frame for depositing payments and having the payments credited to the payees' bank accounts.

In this way, although not the only mode of operation, one mode of processing documents using the apparatus 10 includes scanning the documents extracted from an envelope to acquire image data for each document in the envelope. The scanned documents and/or the image data are analyzed to identify a payment instrument, such as a check from among the documents. Information regarding the payment instrument is extracted, such as the payment amount and the payee. A payment record that includes the image data for the payment instrument is created for each payment instrument. The payment record containing only the payment information for the single payment instrument is then submitted separately for clearance to a financial institution that is not the same as the lockbox processor. Although in certain instances some of the payment records may be batched together for clearing together, preferably at least a majority of the payment records are submitted individually. In addition, some of the checks in a batch of mail may be mis-directed or mis-identified or cannot be processed for some other reason. Although a batch may include some payment records that are batched for submission for clearance, and some of the checks in a batch of mail may not be submitted for clearance, preferably the number of submissions of payment records for clearance for a batch of mail is substantially similar to the number of checks in the mail.

In addition to submitting the payment instruments for clearance, the apparatus is operable to track the deposits and provide records to the customers to inform the customers of the payments that were received and the deposits that were made. As discussed previously, the apparatus is operable to process a batch or mail containing mail for a variety of customers without pre-sorting the mail according to customer. However, as the apparatus processes the documents, the apparatus tracks the payment information for each transaction so that the customer for each payment transaction is identified and stored. The apparatus can then sort the information regarding the payment transactions by customer. In this way, a summary of the payments received for each customer can be created and provided in a report form. In addition, if a customer so desires, a log with all of the information regarding each payment can be created so that a customer can see all of the details for all of the payments received by the lockbox for the customer during a certain time period (e.g. daily, weekly or monthly). The report may include the date and time that a payment was scanned and analyzed, the amount of the payment, the payor, and the date and time that the payment was deposited. The report also may include information extracted from the documents accompanying the payment, such as invoice number or other payment identifying information. The report may be combined together with the image data for the documents in the payments referenced in the report, and stored on a non-volatile storage medium.

In addition, it may be desirable to consolidate the information and the image data for all of the payments received for a particular customer during a particular time at several different processing facilities. More specifically, some lockbox processors may have various regional facilities so that the processing facilities are closer to various areas so that the mail does not have to travel as far to get to the lockbox facility. For instance, a lockbox processor may have a facility on the west coast, a separate facility on the east coast, and a separate facility in the mid-west. In this way, a payment would not be delayed while it is transported through the mail system from a payor on the west coast to a processing facility on the east coast. Instead, the payors are given the address for the closest facility. In this way, a lockbox customer may receive payments each day at each of the three processing facilities. To consolidate the payment information for the customers, the payment information for each customer may be combined for all of the processing facilities so that the reports include payment information for the payment received during a particular time frame at all of the processing facilities. In addition, preferably information regarding the image data is centralized so that the image data can be readily retrieved by the customer regardless of which regional facility processes a payment.

ID Assist Document Tracking System

As an operator processes documents, the operator may notice characteristics of various documents that would affect the processing of the document or transaction. Since the system is configured to process a wide variety of documents, there may be numerous characteristics that could affect how a document is processed. Therefore, the system provides an interface that allows the operator to input information about numerous characteristics of a document.

For instance, preferably, the system display is a touch screen that includes a menu option for identifying a document characteristic. By pressing on the menu option, a separate menu appears on the display listing various document characteristics. The operator can select the appropriate document characteristic from the display. Depending on the characteristic selected, a further submenu may be displayed on the screen to further identify information about the document.

After inputting the appropriate document information, the operator drops the document onto the conveyor. In order to ensure that the document is properly identified with the selected characteristic, the timing of the document identification is done in a particular order. First, the operator waits until all of the preceding documents have been processed. Specifically, before identifying a characteristic for a document, the operator waits until all of the previously dropped documents are conveyed to the feeder. Further still, the operator may wait until all of the documents are fed into the scanner from the feeder before completing the steps to identify the document characteristic.

After ensuring that the preceding documents are conveyed from the transport, the operator completes the selection of the appropriate document characteristic using the touch screen. The operator then drops the document onto the transport and the document is conveyed to the feeder. The document is electronically tagged as having the characteristic identified by the operator, and is then processed accordingly.

Utilizing this method, the system can be used to identify a variety of document characteristics, and process the documents accordingly. Specifically, the method can be utilized to identify any characteristic that can be defined with a yes or no response. For example, a characteristic may be “is the document printed in a landscape orientation?” Accordingly, the system can be used to identify features such the following:

-   -   Color—The operator can identify documents that should be scanned         in color. In some jobs, the default scan may be black and white         or gray scale. If the operator identifies a document for color         scanning, the document is scanned in color rather than black and         white or gray scale.     -   Color dropout—The operator can identify documents that should be         scanned in color, but with a particular color dropped out from         the scan. As part of the set-up for a job, the operator selects         the color that should be dropped from the scan.     -   Transaction boundary—The operator can identify a document as a         transaction boundary. For instance, an operator can identify a         document as being the last document in a transaction. Subsequent         documents will be identified in a separate transaction.     -   Automatic rotation—The operator can identify documents that need         to be rotated, such as documents that are in landscape         orientation.     -   Page-type determination—The operator can identify the document         type, particularly if two different types of documents have         similar physical attributes. For instance, a job may have two         document types that are virtually identical in size, such as a         check and a money order. The page-type determination can be used         to distinguish a money order from a check, so that the document         images can be scanned appropriately and the documents can be         sorted separately, if desired.

This list of document features illustrates the different characteristics that can be identified by the operator. In addition, numerous other characteristics can be identified for different type of documents and different applications. Accordingly, the above list is not an exhaustive list of all of the features that can be used to tag documents for different processing.

Although the foregoing methodology illustrates the flexibility of operation of the system, it may be desirable to improve the flow rate of documents through the system. In particular, the system can be configured to manually identify and electronically tag documents with certain characteristics more rapidly than described above.

Turning now to FIGS. 19-20, a document tracking feature referred to as ID assist is illustrated. The document tracking feature may include a drop area that is designated to receive documents having a particular characteristic. If a document is dropped in the identification drop area, the document is electronically tagged as having the pre-determined characteristic and is processed accordingly.

More specifically, the system 10 includes an ID drop area 104 adjacent the main drop area 101. In the present instance, the ID drop area 104 is located to the right of the main drop area 101. An ID sensor 350 positioned in the ID drop area 104 detects the presence of a document dropped onto the conveyor 100 in the drop area. The ID sensor 350 may be any of a variety of types of sensors. In the present instance, the ID sensor 350 is an infrared sensor positioned adjacent the conveyor 100. The ID sensor 350 is located so that the sensor is blocked when a document is dropped in the ID drop area 104. For instance, the ID sensor 350 may be positioned about 2″ from the right hand edge of the conveyor 100.

In addition to the ID sensor 350, the system includes an intermediate sensor 355 positioned adjacent the main drop area 101, between the ID sensor 350 and the entry feeder 110. Further, the ID assist feature utilizes a sensor adjacent the entry feeder 110 or the imager 150. In the present instance, one of the feeder exit sensors 135 is utilized. The intermediate sensor 355 is operable to detect the leading and trailing edges of a document as the document is conveyed from the ID drop area 104 to the image entry feeder 110. Since the speed at which the conveyor conveys the document is known, the length of the document can be determined based on the elapsed time between when the leading and trailing edges of the document pass the intermediate sensor 355. The intermediate sensor 355 is positioned downstream from the ID sensor 350 and is spaced from the ID sensor a distance greater than the length of the longest document that the system is configured to process in a normal job. In the present instance, the intermediate sensor is positioned 14″ to the left of the ID sensor, which in the present configuration positions the intermediate sensor so that it is right in front of the area where the operator is normally seated. In addition, as shown in FIG. 19, the three sensors 350, 355 and 135 are positioned adjacent the guide rail, so that the sensors are operable to detect documents that are on the conveyor against the guide rail.

The system 10 may also include an indicator light 270 in the form of an LED. The indicator light is used to indicate whether or not the ID drop area is ready (i.e. a document can be dropped in the drop area 104). In the present instance, the indicator light 270 is controlled so that the indicator light is on when the ID drop zone is ready and it is off when the ID drop zone is busy. However, the indicator light can be controlled so that it is on only when the system is busy (i.e. if the indicator light is on a document should not be dropped in the ID drop zone).

Based on the signal received from the sensors 350, 355 a time window is created for the document. A document arriving at the feeder exit sensor 135 during the time window is presumed to be the document that was dropped in the ID drop area 104. Documents arriving at the feeder exit sensor 135 before or after the time window are presumed to be documents other than the document that was dropped in the ID drop area. Therefore, the operator need not wait for the conveyor to clear before dropping a document in the ID drop zone 104 to identify the document characteristic. In addition, the operator need not wait for the document being identified to arrive at the imager 150 before dropping a subsequent document.

Referring to FIG. 20, the tracking of the documents will be described in greater detail. In step 410, a document is placed on the conveyor in the ID drop area 104 so that the document blocks the ID sensor 350. In response, the blocked sensor 350 sends a signal to the system controller, which in turn sends a signal to turn off the LED indicator light 270 at step 415. When the indicator light is off, the operator should not drop a subsequent document into the ID drop area. The conveyor 100 conveys the dropped document toward intermediate sensor 355. After the document is dropped, the system monitors the ID sensor 350 to detect the trailing edge of the dropped document, as shown in step 420. If the trailing edge is not detected within a pre-determined time frame, the system declares a jam, as shown in step 424.

Referring again to step 420, when the trailing edge of the document passes the ID sensor 350, a signal is sent to the system controller indicating that the trailing edge of the document has passed the ID sensor. In response to the trailing edge of the document passing the ID sensor 350, the indicator light 270 is turned on (426) indicating that the operator can drop a subsequent document in the ID drop area. In addition, a time is set to open a time window and the system is set to look for a document at the intermediate sensor 355, as shown in step 430. During the time window, the system is expecting a document to arrive at the intermediate sensor 355. If a document arrives at the intermediate sensor 355 before the beginning of the time window or after the close of the time window, the system assumes that the document at the intermediate sensor is not the document that was dropped in the ID drop area.

The time window corresponds to the amount of time that the document should take to be conveyed to the intermediate sensor 355. For example, if the conveyor operates at 14″ per second, and the intermediate sensor is 14″ to the left of the ID sensor 350, the trailing edge of the document should pass the intermediate sensor 355 one second after the trailing edge passes the ID sensor 350. Therefore, the window should correlate to one second. If the conveyor operates at a slower rate, the window should be longer, and if the conveyor operates at a faster rate, the window could be shorter.

Although the time window corresponds to the amount of time that trailing edge of the document should take to get from the ID sensor to the intermediate sensor, typically the window is set to a longer time than the expected time. The longer time provides for possible delays that can occur as the document travels from the ID sensor to the intermediate sensor. For instance, there can be slippage between the document and the conveyor, especially when the document is first dropped onto the conveyor. Similarly, if the operator presses against the documents or does not completely release the document when the document is placed on the conveyor, the document can be delayed. Accordingly, the time window is set so that the time that the system expects the trailing edge of the document to pass the intermediate sensor 355 may be longer than the time that it could take for a document to travel from the ID sensor to the intermediate sensor under ideal circumstances.

In this way, the operator typically does not need to wait for other documents to be dropped before dropping a document in the ID drop area. Similarly, after a document is dropped in the ID drop area, the operator need not wait for the dropped document to be processed before processing further documents. The operator simply needs to ensure that further documents are not dropped onto the conveyor within an area and time corresponding to the time window for a document dropped in the ID drop area. For example, a mis-identification can occur if the operator drops a first document in the ID drop area, then picks up the document and quickly drops a second document so that the trailing edge of the second document passes the intermediate sensor 355 within the time window of the first document. By using the time window to track the document dropped in the ID drop zone, the system allows the rapid identification of select documents without significant interruption in the flow of documents.

In step 440 the intermediate sensor 355 senses the leading edge of the document. In response to a signal from the sensor indicative of the leading edge, the system controller stores the time that the leading edge arrived at the intermediate sensor (step 442). Additionally, the system opens the time window set in step 430, as shown in step 444. Subsequently, at step 450, if the intermediate sensor 355 senses the trailing edge of the document, the system controller stores the time that the sensor sensed the trailing edge (step 452). At step 455 the system evaluates whether the system was looking for a document to be identified. Specifically, as discussed above, when the ID sensor 455 is blocked, the system is set to look for a document to be identified at step 430. If the system is set to look for a document to identify, then the analysis proceeds to step 460. If the system is not set to look for a document to identify, then the document at the sensor is not tagged as a document to be identified.

At step 460, if the document is set to look for a document to identify, then the system determines whether the document arrived within the time window set in step 430. Specifically, at step 430 the time window is set, and at step 442 the time window is opened. If the time recorded at step 452 comes within the time window, then the system determines that the document detected is the document that was dropped in the ID drop area 104. The system then starts a timer with a time corresponding to when the document is expected to arrive at the feeder exit sensor 135 (step 464).

At step 462 the length of the document is determined based on the time that the leading edge is detected (step 442), the time that the trailing edge is detected (step 452), and the speed at which the conveyor transports the documents, which is known. Based on the time that the leading edge was detected (step 442), the speed of the conveyor (known) and the distance from the intermediate sensor 355 to the feeder exit sensor 135 (a known system characteristic), the system determines when the leading edge of the document should arrive at the feeder exit sensor 135, and starts a timer at step 464.

At step 470, the system detects the leading edge of a document. Since the operator may not have waited to process subsequent documents after dropping a document in the ID drop area, the document at the feeder exit sensor may not be the document dropped in the ID drop area. Accordingly, when the feeder exit sensor 135 detects the leading edge of a document, the system records the time at step 472. At step 480, the system analyzes the data to determine if the time recorded at step 472 correlates to the time set at step 464. If it does, the document is tagged as the document dropped in the ID drop area at step 480. If the time recorded at step 472 does not correlate to the time set at step 464, then the document is processed as a default document rather than as a document that was to be tagged as an identified document.

As described above, the system tracks a document as it is conveyed from the ID drop area 104 to the imager 150. Additionally, unless a document is detected by each of the three sensors 350, 355, 135 the system simply treats the document as a default document and processes the document according to default settings. Further, even if a document is detected by all three sensors, the document is not tagged as an identified document unless the document passes the intermediate sensor 355 and the feeder exit sensor 135 during appropriate time frames. In this way, the system is able to monitor a document that is to be tagged as an identified document, while continuing to process documents that are not to be identified.

Further still, if a document is erroneously dropped in the ID drop area 104, the operator can simply pick up the document at any point along the conveyor before the document reaches the entry feeder 110. More specifically, the system tracks documents at the sensors 355, 135 to determine whether the documents are documents that were dropped in the ID drop zone by monitoring whether the documents arrived at the respective sensor during a time frame corresponding to when the system expects a document to arrive at the respective sensor from the ID sensor 350. If a documents arrives at one of the sensors at a time that does not correspond to a relevant time frame for a document dropped in the ID drop zone, the system simply treats the document as a default document and not one to be tagged as being identified. Therefore, if a document is erroneously dropped in the ID drop zone 104, the operator can simply pick the document back up and re-drop the document in the normal drop zone 101 after a short delay to ensure that the document does not pass the intermediate sensor and the feeder exit sensor 135 during the appropriate time windows.

System Applications

As discussed above, the document ID method has numerous applications for different document characteristics. Typically, prior to processing a batch of documents a document ID characteristic is selected for the job as a job parameter. Documents identified during the job will be tagged as having the characteristic and processed accordingly. There may be several characteristics that need to be identified while processing a job, therefore, the operator may select the characteristic that is expected to occur most frequently.

After selecting the ID characteristic, the operator processes documents by dropping the documents on the conveyor 100. Documents that are to be tagged as having the selected characteristic are dropped in the drop area 104. As discussed above, documents dropped in the drop area are tracked by the system and tagged accordingly.

One example of the system is using the ID assist to identify transaction boundaries. To do so, the job is set up so that the ID characteristic is set to either identify the beginning or end of a transaction. In the following example, the characteristic is set to identify the beginning of the transaction. Accordingly, as an operator is processing documents, the operator can manually identify the documents in a transaction because the operator removes the documents from an envelope (as described previously) or the documents may be stacked in such a way that the operator is able to determine the boundaries for a transaction.

To process a transaction, the operator drops the first document in a transaction onto the ID drop area 104. The operator then waits until the first transaction document passes the intermediate sensor 355 to ensure that subsequent documents do not get associated with a different transaction. The operator then drops the remaining document in the first transaction onto the main drop area 101. After dropping the final document in the first transaction, the operator drops the first document of the second transaction onto the ID drop area 104 to identify the start of the second transaction. The operator waits until the first document of the second transaction passes the intermediate sensor before dropping the rest of the documents in the second transaction onto the main drop area 101. In other words, after the first document of the second transaction is dropped onto the ID drop area 104, the operator drops the subsequent document(s) in the second transaction to ensure that the subsequent document(s) trail the first document in the second transaction.

Assuming that the first two documents dropped onto the ID drop area progress through the system so that the documents reach the intermediate sensor 355 and the feeder exit sensor 135 during the appropriate time windows, the two documents will be identified as the first documents in the first two transactions. The system 10 will process the documents so that the first ID assist identified document and all of the documents between the first and second ID assist identified document are associated together as the first transaction in the job. Additionally, the second ID assist identified document and all of the documents between the second ID assist identified document and the next ID assist identified document are associated together as the second transaction in the job. In this way, the documents in a job can be readily associated and separated into the proper transactions.

Although the system has been described as operating to automatically tag the document as having a pre-determined characteristic, the system may be used in certain instances to identify further information through operator prompt. For instance, dropping a document in the ID drop area 104 may operate to automatically prompt the user for further information. For example, in response to sensing a document in the ID drop area 104, the system may automatically open a particular menu having a variety of selections, such as page-types. The user then selects the appropriate choice from the selection and the system then processes the document as discussed above. If the document arrives at the intermediate sensor 355 and the feeder exit sensor 135 within the appropriate time frames, the document is tagged with the information selected by the user and processes the document accordingly.

Additionally, the ID sensor can be used as a more generalized indicator that a characteristic is to be selected for the document. Therefore, instead of having to wait for the transport to be cleared and then indicate that a document characteristic is to be identified for the document, and then selecting the characteristic, the ID sensor may cause a general menu to automatically open up allowing the operator to select the characteristic and have it applied to the document. In this way, the system can operate in a semi-automatic mode to speed document processing while allowing the operator to input various selection criteria for the documents.

As discussed above, the ID drop feature allows the system to automatically tag a document with a particular characteristic. In addition, the system can use the ID information in combination with other information determined during processing to identify how the document is to be processed. For instance, during processing, the system can be used to determine various information about a document, such as length, width and presence of an OCR or MICR line. This information can be combined with the ID assist to determine how the document should be processed. By way of example, the system may determine that a document is a check or money order in response to the measured length and width of a document and the presence of a MICR line in a pre-determined area. To distinguish between a check and a similarly sized money order, the system may require that the money order by identified by the ID assist as described above. Therefore, to identify a document as a money order, the document must have the appropriate characteristics (length, width, MICR line) along with having been identified by the ID assist. If the document was not identified by the ID assist, the system will identify the document as a check based on the length width and MICR line information.

By way of another example, the ID assist feature can improve the processing of color documents. When processing documents, it may be desirable to save certain portions or snippets of document image data in color rather than (or in addition to) converting the entire image data file to gray scale or black and white images. The system may determine that a document should have snippets saved in color based on the document being dropped in the drop area 104. However, the system 10 may still need to identify the page-type to determine the locations for the snippets to be saved in color. For instance, the system may need to determine whether the document is an 8½×11 document in portrait or landscape orientation, or whether the document is a check or a payment stub.

The system 10 may determine the page-type computationally by evaluating various features detected by the system (such as height, width, MICR line, OCR line, font orientation or other characteristics). Alternatively, the ID assist features may be configured to identify the page-type. In other words, if a document is dropped in the drop area, the document is tagged as a document having a particular page-type that should have certain portions of image data scanned and saved in color. In such a set-up, the ID assist may force the system to identify the document as a particular document type regardless of other features of the document.

Alternatively, when used in combination with an operator prompt, the system may identify the page-type by operator input. Specifically, the job may be set-up so that the ID characteristic is a color scan, and so that dropping a document in the ID drop area prompts the operator to input the page-type, such as by selecting from a menu list on the touchscreen display. In this way a document may be tagged to be scanned in color and the page-type is also identified so that the system can automatically scan and save the appropriate portions of the document in color based on the identified document type.

The ID assist feature can be also used to override default determinations that the systems makes based on certain characteristics. For instance, for color scanning, the system may be set up so that prior to scanning a document in color, the operator drops the document in the ID drop area 104 to identify the document as one that is to be scanned in color. When processing the document, the system automatically determines the page-type for the document so that the proper areas are scanned in color. As discussed previously, the system may use various document characteristics to determine the page-type, including the length and width of the document, the presence of an OCR line, a MICR line in a pre-determined area, and other features. If there are numerous page-types in a job, the page-type determination can be computationally intense.

Although the job can be set-up to prompt the operator to input the page-type when a page is dropped in the ID drop area 104, the job can be set-up to process the documents more efficiently. Specifically, if only certain page-types that are to be scanned in color are expected for a job, the job can be set-up so that only such page-types are considered when performing an automated page-type determination. For example, a job may have 20 different page-types, but only five page-types to be scanned in color are anticipated for the job. When setting up the job, the ID assist is set so that the five anticipated page-types are set to require ID assist identification, while the remaining 15 page-types are set to require that the document not be identified using ID assist. In this way, if a document is identified by ID assist, the system will scan the document (or portions) in color. In addition, the system will determine the page-type for the document automatically. Since the document was identified using ID assist, the system will evaluate the page and compare it against only the 5 page-types set-up for the job, rather than comparing the document characteristics against all of the possible page-types for a job.

If the operator encounters a document in a job that is to be scanned in color, but the document has a page-type that is different from the 5 page-types set-up for the job, the operator can still process the document so that it is scanned in color. However, the operator should process the document without the ID assist. In other words, the operator will manually identify the document as one to be scanned in color, such as by making one or more selections on the touchscreen display. Further, since the 5 page-types were set to require ID assist identification, the system need only compare the document against the remaining 15 page-types when making an automated page-type determination. By limiting the page-types to be used in the automated page-type determination, the ID assist may improve the efficiency of the system, even when the ID assist feature is not used for a particular document. 

1. An apparatus for processing documents, comprising: a conveyor having a generally horizontal surface configured to receive and convey documents that are manually dropped onto the conveyor in a generally horizontal orientation; a scanner operable to receive documents from the conveyor and scan the documents to obtain image data for the documents; and a feeder configured to receive a plurality of documents from the conveyor in a generally horizontal orientation and serially feed the documents toward the scanner; and a flattener for automatically flattening folds of the documents prior to the documents entering the scanner.
 2. The apparatus of claim 1 wherein the flattener is positioned between the conveyor and the scanner.
 3. The apparatus of claim 1 wherein the flattening element is operable to flatten folds in the documents along substantially the entire length of the documents prior to the documents entering the scanner.
 4. The apparatus of claim 1 wherein the flattening element comprises: a gripping portion for positively entraining a first end of each of the documents; and an elongated low friction portion to allow portions the documents to slide relative the first end as the documents pass through the flattening element.
 5. The apparatus of claim 1 wherein the flattening element is configured to flatten the width of the document without substantially buckling the document.
 6. The apparatus of claim 1 comprising a non-volatile image storage medium for storing the image data for the documents.
 7. The apparatus of claim 1 comprising a re-orientation element operable to re-orient the documents as the documents are conveyed toward the scanner in a generally horizontal orientation.
 8. The apparatus of claim 7 wherein the re-orientation element is operable to re-orient the documents without nipping the documents.
 9. The apparatus of claim 1 wherein the documents are contained within an envelope and the apparatus comprises an opener operable to open the envelope so that the documents can be removed from the envelope.
 10. The apparatus of claim 1 wherein the feeder has a feed nip for nipping the documents, and the feeder is configured to receive documents having a leading edge folded upwardly so that the leading edge is positioned above the feed nip, wherein the feeder is configured to drive the leading edge of such folded documents downwardly toward the feed nip.
 11. The apparatus of claim 1 wherein the conveyor comprises a substantially horizontal conveyor belt or a plurality of rollers configured to support and convey the documents in a generally horizontal orientation.
 12. The apparatus of claim 1 wherein the conveyor is configured to convey the documents to the feeder without nipping the documents.
 13. The apparatus of claim 1 wherein the documents are contained within envelopes, and the apparatus comprises an opener operable to open an envelope to facilitate removal of documents from the envelope.
 14. The apparatus of claim 13 wherein the opener comprises: an input bin for receiving a stack of envelopes containing documents; a feeder for feeding an envelope from the stack of envelopes; a cutter operable to cut at least one edge of the envelope; and an extractor operable to open the edge-severed envelope by pulling apart the envelope to expose the documents for manual removal by an operator.
 15. The apparatus of claim 1 wherein the feeder is positioned adjacent an end of the conveyor, such that the conveyor conveys documents dropped onto the conveyor to the feeder.
 16. The apparatus of claim 1 wherein the documents are fed along a document path from the feeder to the scanner.
 17. An apparatus for processing documents, comprising: a conveyor having a generally horizontal surface configured to receive and convey documents that are manually dropped onto the conveyor in a generally horizontal orientation; a scanner operable to receive documents from the conveyor and scan the documents to obtain image data for the documents; and a feeder configured to receive a plurality of documents from the conveyor in a generally horizontal orientation and serially feed the documents toward the scanner; and a re-orientation element operable to re-orient the documents as the documents are conveyed toward the scanner in a generally horizontal orientation.
 18. The apparatus of claim 17 wherein the re-orientation element is operable to re-orient the documents without nipping the documents.
 19. The apparatus of claim 17 wherein the re-orientation element comprises a surface disposed an at acute angle relative to the direction of travel of the conveyor.
 20. The apparatus of claim 19 wherein the conveyor is operable to drive documents in a generally horizontal orientation toward the re-orientation element to orient an edge of the documents against the surface of the re-orientation element.
 21. The apparatus of claim 17 wherein the re-orientation element comprises an elongated element having a substantially straight edge. 